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General Description D ^ 



DKA pVR 1012-GP(IC) 
Local object 

Created: 09/14/98 04:17PM 
Last Modification Date: ? (no data) 
length: 7003 bp 
storage type: Basic 
fom: Circular 
Cosoaents 

Restriction Map 
BgllMsite 

ClaMsite £g»* 
Drain: 1 site g*Sgg 
EcoRV: 1 site ^trtc 
HindUl: 1 site 
Hpal: 1 site 



KasMsite «*ggg 

Kpnl: 1 site gg&gg 

Narhlsite ggggjg 

PmlMsite 

Pstl: 1 site gg»g 

Pvul- 1 site cgatcG 

rvui. i sue goitcc 



Sacli: 1 site 



Sail: 1 site g*gg£g 
Xmnl- 1 site gaakknnttc 

Amni. I Site CTTNKNNAAG 

EcoRI:2sites g***g 
Nco1:2sites gg*£gg 
Ndel:2sites g^ 
Sphl: 2 sites 
Xhol: 2 sites g£gg$g 
BamHI: 3 sites ggjg 
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Bell: 3 sites 



TgATCA 



Functional Map 

CDS (4 signals) 

CMV IE 5' UT 

Start: BB6 Ends 1129 
CMV IE INT 

Starts 1130 End; 1840 

TbGH 

Start j 4020 Ends 4 572 

Kanr 

Start: 6068 End: 6690 (Complementary) 
Miscjfeature (2 signals) 
CMV enhancer 

Stare: 248 Endi 885 

GP(IC) 

Start: 187 0 End: 4 019 

Annotations 
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2COCGCGTTT 
AGCGCGCAAA 


CGGTCATGAC GGTGAAAACC 
GCCACTACTC CCACTTTTGG 


TCTGACACAT 
A&&CTGTG7A 


CCAGCTCCCG 
CGTCGAGGGC 




51 


GAGACCOTCA 
CTCTGCCAGT 


CAGCTTGTCT GTAAGCGGAT 
GTCGAACAGA CATTCGCCTA 


CCCCGOAGCA 
CGGCCCTCGT 


GACAAGCCCG 
CTGTTCGGGC 




101 


TCAGGGCGCG 
AOTCCCGCGC 


TCAGCGGGTG TTCGCGGGTG TCGGGGCTGG 
AGTCGCCCAC AACCGCCCAC AGCCCCGACC 


CTTAACTATG 
GAATTGATAC 










Kdel 






151 


CGGCATCAGA GCAGATTCTA CTGAGAGTCC 
GCCGTAGTC? CCTCTAAC&T CACTC7CACG 


ACCATATGCC 
TGGTATACGC 


GTGTCAAATA 
C AC A£i 1 J.AT 




201 


CCGCACAGAT CCCTAAGGAG AAAATACCGC 
GGCGTGTCTA CGCATTCCTC TTTTATGGCG 


ATCAGATTGC 
TACTCTAACC 


CTAT7GGCCA 
GATAACCGGT 




251 


TTGCATACGT 
AACGTATGCA 


TCTATCCATA TCATAATATC 
ACATAGGTAT AG7ATTATAC 


TACATTTATA 
ATGTAAA7AT 


TTGCCTCATG 
AACCGAGTAC 




301 


TCCAACATTA CCGCCATGT7 GACATTGATT ATTGACTAG7 
AGG7TGTAAT CGCGGTACAA CCGTAACTAA TAAG7GATCA 


TATTAATAGT 
ATAATTATCA 




351 


AATCAATTAC GGGGTCATTA CTTCATAGCC CATATAYCCA GT7CCGCGTT 
TCAGTTAATG CCCCAGTAAT CAAGTATCGG GTATATACCT CAAGGCGCAA 




401 


ACATAACTCA CGGTAAATGG CCCGCCTGGC 
TOTATTGAAT GCCATTTACC GGGCGGACCC 


TGACCGCCCA ACGACCCCCG 
ACTGGCGGG? TGCTGCCGGC 




451 


CCCATTGACG 
GGGTAACTGC 


TCAATAATGA CGTATGTTCC 
AGTTATTACT GCATACAAGG 


CATAGTAAGG 
GTATCATTGC 


CCAATAGGGA 
GGTTATCCC? 




501 


CTTTCCATTG 
GAAAGGTAAC 


ACGTCAATCC GTGGAGTATT 
TGCAGTTACC CACCTCATAA 


TACGGTAAAC 
ATGCCATTTG 


TGCCCACTTG 
ACGGGTGAAC 
















551 


CCAGTACATC 
CGTCACCTAO 


AACTGTATCA 7ATGCCAAGT 
TTCACATAGT ATACGGTTCA 


ACGCCCCCTA 
TCCGCGGGAT 


TTGACGTCAA 
AACTCCACTT 




601 


TGACGGTAAA 
ACTGCCATTT 


TGGCCCGCCT GGCATTATGC 
ACCGGGCCGA CCGTAATACC 


CCAGTACA7G 
CGTCATGTAC 


ACCT7ATGGG 
TGGAATACCC 












Ncol 




€51 


ACTTTCCTAC 
TGAAAGGA7G 


TTGGCAGTAC ATCTACGTAT TAGTCATCGC 
AACCGTCATG TAGATCCATA ATCAGTAGCG 


TATTACCATG 
ATAATGGTAC 






JCcoI 










701 


GTGATGCGGT TTTGGCAGTA CATCAATGGG CGTGGATAGC 
CACTACGCCA AAACCCTCAT GTAGTTACCC GCACCTATCG 


GGTTTGACTC 
CCAAACTCAC 




751 


ACGGCGATTT CCAAGTCTCC ACCCCATTGA CCTCAATGGG AGTTTGTTTT 
TCCCCCTAAA GCTTCAGAGG TGGGGTAACT GCAGTTACCC TCAAACAAAA 





601 GGCACCAAAA 7CAACGGGAC TTTCCAAAAT GTCGTAACAA CTCCCCCCCA 
CCGTCGTTTT AGTTGCCCTC AAAGGTTTTA CACCATTCTT GAGGCGGGGT 
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651 TTGACGCAAA TCCGCGOTAG GCGTGTACGG TGGQAGGTCT ATATAAGCAG 
AACTGCGTTT A CCC GCCATC CCCACATGCC ACCCTCCAGA TATATTCGTC 



SOX 


AGCTCGTT7A 
TCGAGCAAAT 


GTGAACCGTC 
CACTTGGCAG 


AGATCGCCTC 
TCTAGCGGAC 


GAGACOCCAT 
CTCTGCGGTA 


CCACGCTGTT 
GGTGCGACAA 










Saczx 




AACTGGAG37 


TAGAAGACAC CGGGACCGAT CCAGCCTCCG 
ATCTTCTGTG GCCCTGGCTA GGTCGGAGGC 


CGGCCGGGAA 
CCCGGCCCTT 


1001 


CGCTGCAT7G 
CCCACGTAAC 


GAACGCGGAT 
CTTCCGCCTA 


TCCCCCTGCC 
AGGCCCACGG 


AAGAGTGACG 
TTCTCACTGC 


TAAGTACCGC 
ATTCATGGCG 










sphZ 


1051 


CTATAGACTC 
GATATCTGAG 


TATAGGCACA CCCCTTTGGC 
ATATCCGTGT GGGGAAACCG 


7CTTATGCAT 
AGAATACGTA 


GCTATACTGT 
CGATATGACA 


1101 


AAAACCGAAC 


GGCCCTATAC 
CCCGGATATG 


ACCCCCGCTT 
TCCGGGCGAA 


CCTTATGCTA 
GGAATACGA? 


TAGGTGATGG 
ATCCACTACC 


1151 


TATAGCTTAG 
ATA7CGAATC 


CCTATAGGTG 
GGA7A7CCAC 


TGGGTTATTG 
ACCCAATAAC 


ACCATTATTG 
TCG7AATAAC 


ACCACTCCCC 
TGGTGAGGGG 


1201 


TAT^GGTGAC 
ATAACCACTG 


CAfACTTTCC 
CTATGAAAGG 


ACTACTAATC 
TAATGATTAG 


CATAACATCG 
GTATTGTACC 


CTCTTTGCCA 
GAGAAACCGT 


1251 


CAACTATCTC 
GTTCATAGAG 


TATTGGCTAT 
ATAACCGATA 


ATGCCAATAC 
TACGGTTATG 


TCTGTCCTTC 
AGACAGGAAG 


AGAGACTGAC 
7CTCTGACTG 


1301 


ACGGACTCTG 
TCCCTGAGAC 


TATXTTTACA 
ATAAAAATGT 


GGATGGGGTC 
CCTACCCCAG 


CCATTTATTA 
GGTAAATAAT 


TTTACAAATT 
AAATGTTTAA 


1351 


CACATATACA ACAACGCCGT 
GTCTATATGT TGTTCCGGCA 


CCCCCGTGCC 
GGGGGCACGG 


CGCAGTT7TT 
GCGTCAAAAA 


ATOAAACATA 
TAA7TTGTAT 


1401 


GCGTCCGATC 
CCCACCCTAG 


TCCACCCGAA 
ACGTGCGCTT 


CC7CGCGTAC 
AGAGCCCATG 


GTGTTCCGCA CATGGGCTC? 
CACAAGGCCT GTACCCGACA 


1451 


TCTCCGGTAG 
AGAGGCCATC 


CGGCGGAGCT TCCACATCCG ACCCCTGGTC 
GCCGCCTCGA AGGTGTAGGC TCGGGACCAG 


CCATGCCTCC 
CGTACGGAGG 


1501 


AGCGCCTCAT GGTCGCTCGG 
TCGCCGAGTA CCAGCGAGCC 


CAGCTCCTTG 
GTCGAGGAAC 


CTCCTAACAG 
GAGGATTGTC 


TGGAGGCCAG 
ACCTCCGCTC 


1551 


ACTTAGGCAC 
TCAATCCGTG 


AGGACAATGC 
TCGTGTTACC 


CCACCACCAC 
CGTGGTGGTG 


CAGTCTCCCG 
GTCACACGGC 


CACAAGGCCO 
GTCTTCCGGC 


1601 


TGGCGGTAGG 
ACCGCCATCC 


GTATGTGTCT GAAAATGAGC 
CATACACAGA CTTTTACTCG 


GTGGAGATTG 
CACCTCTAAC 


GGCTCGCACG 
CCGAGCGTGC 



1651 GCTGACGCAG ATGGAAGACT TAAGGCAGCG GCAGAAGAAG ATGCAGGCAG 
CGAC7GCCTC TACCTTCTCA ATTCCGTCGC CGTCTTCTTC TACGTCCGTC 



1701 CCGAGTXCTT GTATTCTGAT AAGAGTCAGA GGTAACTCCC CTTGCGGTGC 
GACTCAACAA CATAACACTA TTCTCAGTCT CCATTGACGG CAACGCCACO 
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Kp*T 

1751 TGTTAACGGT GGAGGCCAGT GTAGTCTGAG CACTACTCG7 TGCTGCCCCG 
ACAATTGCCA CCTCCCGTCA CATCAGACTC GTCATGAGCA ACGACGGCGC 



MCOI 



1801 CGCGCCACCA GACATAATAG CTGACAGACT AACAGACTCT TCCTTTCCAT 
GCGCGGTGGT CTCTATTATC GACTGTCTGA TTG7CTGACA AGGAAAGCTA 



Sail 

Ncol P«tl Pmll Bell ECORV 



1851 GGGTCTTTTC TGCAGTCACC GTCGTCGACA CGTOTGATCA GATATCGCGG 
CCCAGAAAAG ACGTCAGTGG CAGCAGCTGT GCACACTAGT CTATAGCGCC 



ECORI 

1901 CCGCCCGGCC GCTCTAGAAT TCTCTAATCA CAGTCATCAT GGGAGCGTCA 
GiCGCGCCGG CCAGATCTTA AGAGATTAGT GTCAGTAGTA CCCTCGCAGT 



'951 GCGATTCTGC AATTGCCCCG TGAGCGCTTC ACGAAAACAT CTTTCWTGT 
CCCTAAGACG TTAACGGGGC ACCCGCGAAG TCCTTTTGTA CAAAGAAACA 



2001 TTGGGTAACA ATCCTATTCC ATAAAGTCrT TTCAATCCCG TTGGGGGTTG 
AACCCATTAT TAGGATAAGG TATTTCAGAA AAGTTAGGGC AACCCCCAAC 



2051 TACACAACAA TACCCTACAA GXGAGTGATA TCGACAAGTT TCTGTGCCGA 
ATGTGTTCTT ATGGGATGTT CACTCACTAT AACTGTTCAA ACACACGGCT 



2101 GACAAACTCT CTTCAACTAG CCAATTGAAG TCAGTCGGGT TCAACTTGGA 
CTGTTTGAGA GAACTTGATC GGTTAACTTC AGTCAGCCCA ACTTGAACCT 



2151 GGGCAATGGA CTAGCAACTO ATCTACCAAC GGCAACCAAA AGATGCCGTT 
CCCGTTACCT CATCGTTGAC TACATGGTTG CCGTTGGTTT TCTACCCCAA 



2201 TTCGAGCXGG TGTTCCACCA AAGGTGGTAA ATTACGAAGC CGGAGAATGG 
AAGCTCCACC ACAAGCTGGT TTCCACCATT TAATGCTTCG ACCTCTTACC 



2251 GCTGAGAACT GTTATAACCT GGCTATAAAG AAAGTTGATG GTAGTGAGTG 
CGAC7C7TGA CAATATTGGA CCGATATTTC TTTCAAGTAC CATCACTCAC 



2301 CCTACCAGAA GCCCCTGAGG GAGTGAGGGA TTTTCCCCGT TGCCGCTATG 
GGATGGTCTT CGGGGAGTCC CTCACTCCCT AAAAGGGGCA ACGGCGATAC 



2351 TACACAAAGT CTCAGGAACT GGACCATCCC CAGGAGGACT CGCCTTTCAC 
ATGTGTTXCA GAGTCCTTGA CCTGGTACGG GTCCTCCTGA GCCCAAAGTG 



2401 AAAGAAGGAG CCTTCTTCCT GTATGACCCA CTCGCATCAA CAATCATTTA 
TTTCTTCCTC GGAAGAAGGA CATACTGGCT GAGCGTAGTT GTTAGTAAAT 



2451 TCGGGGTACA ACCTTTGCCG AAGGAGTTAT TCCATTTCTC ATCTTGCCTA 
ACCCCCATGT TGGAAACGGC TTCCTCAATA ACGTAAAGAC TAGAACGGAT 



2501 AGGCGCGAAA GGATTTTTTC CAGTCTCCTC CATTGCATGA GCCTCCCAAC 
TCCGCGCTTT CCTAAAAAAG GTCAGAGGAG GTAACGTACT CGGACGGTTG 
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BaznJZX 

255X ATOACCACGG ATCCCTCCAG TTACTATCAC ACGACAACAA TAAACTACOT 
TACTCCTCCC TAGGGAGGTC AATGATAGTG TGCTCTTGTT ATTTGATGCA 



2501 GGTTGATAAT TCTGGAACCA ACACCACAGA GTTTCTCTTC CAAGTCGATC 
CCAACTATTA AAACCTTGGT TGTGGTCTCT CAAAOACAAG GTTCAGC*AG 



Xfaol 

2651 ATTTGACGTA TGTGCAGCTC GAGGCAAGAT TCACACCACA ATTCCTTGTC 
TAAACTCCAT ACACGTCGAG CTCCGTTCTA AGTCTCGTGT TAAGGAACAG 



2701 CTCCTAAATG A AACC A TCTA CTCTGATAAC CGCAGAAGTA ACACAACAGG 
GAOGATTTAC TTTGGTACAT GAGACTATTG CC C TCT T CAT 7GTGTTGTCC 



2751 AAAACTAATC TGGAAAATAA ATCCCACTGT TCATACCAGC ATGGGTGAGT 
TTTTGATTAG ACCTTTTATT TAGCGTGACA ACTATGGTCG TACCCACTCA 



2801 CGCCTTTCTG GGAAAATAAA AAAACT7CAC AAAAACCCTT TCAAGTGAAG 
CCCGAAAGAC CCTTTTATTT TTTTGAAGTG TTT7TGGCAA AGTCCACTTC 



2851 AGTTGTCCTT CCTACCTGTA CCAGAAACCC AGAACCACGT CCTTGACACG 
TCAACAGAAA GCATGGACAT GGTCTTTGGG TCTTGCTCCA GGAACTGTGC 



2901 ACASCGACGG TCTCTCCTCC CATCTCCGCC CACAACCACG CAGGCGAAGA 
7GTCGCTCCC AGAGAGGAGC GTAGAGGCGG GTCTTCGTGC GICCGCT3C? 



2951 CCACAAAGAA TTGGTMCAG AGGATTCCAC TCCAGTGGT7 CAGATGCAAA 
GGTGTTTCTT AACCAAAG7C TCCTAAGGTG AGGTCACCAA GTCTACGTTT 



3001 ACATCAACGG AAAGGACACA ATGCCAACCA CAGTGACGGG TGTACCAACA 
TCTACTTCCC TTTCCTGTGT TACCCTMGT GTCACTGCCC ACATGGTTGT 



Bell 

3051 ACCACACCCT CTCCATTTCC AATCAATGCT CCCAACACTG ATCATACCAA 
TGGTCTGGGA GAGGTAAAGG T7AGTTACCA GCGTTGTGAC TACTATGGTT 



3101 ATCATTTATC GGCCTGGAGG GGCCCCAAOA AGACCACAGC ACCACACAGC 
TAGTAAATAG CCGGACC7CC CCGGGGTTCT TCTGGTGTCG TGGTCOGTCG 



3151 CTCCCAAGAC CACCAGCCAA CCAACCAACA GCACAGAATC GACGACACTA 
GACGGTTCM GTGGTCGGTT GGTTGGTTGX CGTGTCTTAG CTGCTGTGAT 



3201 AACCGAACAT CAGAGCCCTC CAGTAGAGGC ACGGOACCAT CCAGCCCCAC 
TTCGGTTGTA GTCTCGGGAG GTCATCTCCG TGCCCTGGTA GGTCGGGGTC 



3251 GGTCCCCAAC ACCACAGAAA QCCACGCCGA ACTTGGCAAG ACAACCCCAA 
CCAGGGGTTG TCGTGTCTTT CGGTGCOOCT TGAACCCMC TCTTGGGCT* 



3301 CCACACTCCC AGAACAGCAC ACTGCCGCCA GTCCCATTCC AAGAGCCCTO 
GGTGTGAGGG TCTTCTCGTG TGACGGCGGT CACGGTAAGG TTCTCGGCAC 



3351 CACCCCGACG AACTCAGTGG ACCTGGCTTC CTGACGAACA CAATACGGGG 
GXGGGGCTCC TTGAGTCACC TGGACCGAAG CACTGCTTGT GTTATGCCCC 
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BamHI 

3401 GGTGACAAAT CTCCTQACAG GATCCAGAAG AAAGCGAAGG GATCTCACTC 
CCACTCTTTA GAGGACTCTC CTAGGTCTTC TTTCGCTTCC CTACAGTGAG 

3451 CCAATACACA ACCCAAATGC AACCCAAACC TGCACTATTG GACACCCTTG 
GC7TATGTGT TGGCTTTACG TTGGGTTTGG ACCTGATAAC CTGTCGGAAC 

3501 GATGAGGGTG CTGCCATAGG TTTAGCC7GG ATACCATACT TCGGGCCAGC 
CTACTCCCAC GACGGTATCC AAATCCCACC TATGGTATGA AGCCCGGTCG 

3551 AGCTGAGGGA ATTTACACTG AAGGCATAAT CCAGAA5JCAA AATGGATTGA 
TCGACTCCCT TAAATGTGAC MCCCTATTA CCT C TT A CTT TTACCTAACT 

3601 TCTGTGGATT GAGGCAGCTG GCCAACCAAA CGACACAAGC TCTTCAATTG 
AGACACCTAA CTCCGTCGAC CGCTTCCTTT GCTGTGTTCG AGAAGTTAAC 
. . . i 

3651 TTCTTAAGCG CAACTACTGA GTTGCCTACA TTCTCTATAC TAAATCGGAA 
AAGAATTCCC GV7GATGACT CAACGCATGT AAGAGATATG ATTTAGCCIT 

3701 AGCAATAGAC TTCTTGCTCC AAAGATGGGG AGGAACATGT CACATTCTAG 
TCCTTATCTG AAGAACGACG TTTCTACCCC TCCTTGTACA CTGTAAGATC 

3751 GGCCTGA2TG TTGCATTGAA CCCCAAGATT GGACCAAAAA TATCACTGAT 
CCGGACTAAC AACGTAACTT GGGGTTC7AA CCTCGTTTTT ATAG7GACTA 

Bell 

3801 AAAATTGASC AAATAATCCA TGACTTTCTC GATAATAATC TTCCAAATCA 
TTTTAACTAG TTTATTAGGT ACTGAAACAG CTATTATTAG AAGGTTTAGT 

3851 GAA7CA7CGC AGCAAC7GCT CGACTGGATG GAAACAATGG GTTCCTGCTG 
CTTACTACCG TCCTTGACCA CCTGACCTAC CrCTGTTACC CAAGGACCAC 

3901 GAATAGGAAT CACACGAGTA ATCATTGCTA TTATTGCTTT GCTGTCCATT 
CTTATCCTTA GTGTCCTCAT TAGTAACGAT AATAACGAAA CCACACGTAA 

EcoRI 

3951 TCCAAATTCA TGCTTTGAAC TAATATAGCA TCATACTTTA GAATTCTAGA 
ACGTTTAAGT ACGAAACTTG AWATATCGT AGTATGAAAT CTTAAGATCT 

Narl 

XftSl BamKI BfflXI 

4001 CCAGGCCCCT GGATCCAGAT CTCCTGTGCC TTCTAGTTGC CAGCCA7CTG 
GGTCCGCGGA CCTAGGTCTA GACGACACGG AAGATCAACG GTCGG7AGAC 

4051 CTGTTTGCCC CTCCCCCGTG CCTTCCTTGA CCCTGGAAGG TGCCACTCCC 
AACAAACGGG GAGGGGGCAC GGAACGAACT GGGACCTTCC ACGGTGAGGG 

4101 ACTGTCCTTT CCTAATAAAA TCAGGAAATT GCATCGCATT CTCTGAGTAG 
TCACAGGAAA GGATTATTTT ACTCCTTTAA CGTAGCGTAA CAGACTCATC 



4151 GTGTCATTCT ATTCTGGGGG GTG6GGTCGG GCAGCACAGC AAGGGGGAGG 
CACAGTAAGA TAAGACCCCC CACCCCACCC CGTCGTCTCC TTCCCCCTCC 
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4201 


ATTCSGAAGA CAATAGCAGG 
TAJLCCCTTCT GTTATCGTCC 


SptH Kpnl 

CATGCTCGGG ATGCGGTGGG CTCTATGGGT 
GTACCACCCC TACGCCACCC GAGATACCCA 






jqpnl 






4251 


ACCCACGTGC TGAACAATTG ACCCGGCTCC TCCTGGGCCA GAAAGAAGCA 
TCGGTCCACC ACTTCTTAAC TGGGCCAAGG AGGACCCCCT CTT7CCTCGT 




4301 


CCCACATCCC CTTCTCTGTG 
CC6TGTAGGG GAAGAGACAC 


ACACACCC7G TCCACGCCCC TGGTTCTTAG 
TGTGTGGGAC AGGTGCCGGG ACCAAGAATC 




4351 


TTCCAGCCCC ACTCATAGGA CACTCATAGC TCAGCAGCCC TCCGCCTTCA 
AAGGTCGGCC TGAGTATCCT CTGAGIATCG ACTCCTCCCG AGGCGGAAGT 




4401 


A2CCCACCCC CTAAAGTACT TCGAGCGGTC TCTCCCTCCC TCATCAGCCC 
TAGGCTCGGC GATTTCATCA ACCTCGCCAG AGAGGGAGGG AGTAGTCGGG 




4451 


ACCAAACCAA ACCTAGCCTC 
7GGTTTGGTT TGGATCGGAG 


CAAGAGTGGG AAGAAATTAA AGCAAGATAG 

GrrCTCACCC ttctttaatt tcgttctatc 




4501 


GCTATTAAGT GCAGAGGGAG 
CGATAATTCA CGTCTCCCTC 


AGAAAATCCC TCCAACATGT CACGAAGXAA 
TCTTTTACGG AGGTTGTACA C7CCTTCATT 






Xoal 




4551 


TGAGAGAAAT CATASAATTT 
ACTCTCTTTA GTATCTTAAA 


CTTCCCCTTC CTCGCTCACT GACTCGCTGC 
GAAGGCGAAG GAGCCAGTGA CTGAGCGACG 




4601 


CC7CGGTCGT TCGGCTGCGG 
CGAGCCAGCA AGCCGACGCC 


CGAGCGGTAT CAGCTCACTC AAAGGCGGTA 
CCTCGCCATA GTCGAGTGAG TTTCCSCCAT 




4651 


ATACGGTTA? CCACAGAATC 
TATGCCAATA GGTGTCTTAG 


AGGGGATAAC GCAGGAAAGA ACATGTGAGC 
TCCCCTATTG CCTCCTTTC7 TG7ACACTCG 




4701 


AAAAGGCCAG CAAAAGGCCA 
TTTTCCCGTC GTTTTCCGGT 


GGAACCGTAA AAACGCCGCG TTGCTGCCGT 
CCTTGGCATT TTOCCGGCGC AACGACCGCA 




4751 


TTWCCATAG GCTCCGCCCC 
AAAAGGTATC CGAGGCGGGG 


CCTGACGAGC ATCACAAAAA TCGACGCTCA 
CCACTGCTCG TAGTCTKTT AGCTGCGAGT 




4801 


AGTCAGAGGT GGCGAAACCC 
TCAGTCTCCA CCGCTTTGGG 


GACAGGACTA TAAAGATACC AGGCGTCTCC 
CTGTCCTGAT ATTTCTATGG TCCCCAAAGG 




4SS1 


CCCTGGAAGC TCCCTCGTGC 
GGGACCTTCG AGGGACCACG 


CCTCTCCTGT TCCGACCCTG CCGCTTACCG 
CGAGAGGACA AGGCTGGCAC GGCGAATGGC 




4901 


GATACCTGTC CGCCTTTCTC 
CTATGGACAG GCGGAAAGAG 


CCTTCGGGAA GCCTGGCGCT TTCTCAATGC 
GGAAGCCCTT CGGACCCCCA AAGAGT7ACC 





4951 TCACGCTGTA CGTATCTCAG TTCGGTGTAG CTCGTTCGCT CCAAGCTGGG 
AGTGCGACAT CCATACACTC AAGCCACATC CAGCAAGCGA GGTTCGACCC 



5001 CTGTCTCCAC GAACCCCCCG TTCAGCCCGA CCGCTGCGCC TTATCCGCTA 
GACACACGTG CTTGGGGGGC AAGTCGGGCT GGCGACGCGG AATAGGCCAT 



Q '9 'S A. 3 «3 O 'Si „, O Si JL "7 O JL 





WO 99/32147 

9 




PCT/US98/27364 


5051 


ACTATCCTCT TGA6TGCAAC CCGGTAAOAC ACGACTTATC 
TGAXAGCA6A ACTCACCTTG GGCCATTCTG TGCTGAATAG 


GCCACTGGCA 
CGGTGACCGT 




5101 


GCACCCACTG 
CCTCGGTGAC 


GTAACAGGAT TAGCAGAGCG AGGTATGTAG 
CATTGTCCTA ATCGTCTCGC TCCATACATC 


GCGGTGCTAC 
CGCCACGATG 




5151 


AGAG7TCTTG 
TCTCAAGAAC 


AACTGGTGGC CTAACTACGG CTACACTAGA 
TTCACCACCG GATTGATGCC CATC7GATCT 


AGGACAG7AT 
TCCTGTCATA 




5201 


TTGGTATCTG 
AACCATAGAC 


CGCTCTGCTG AAGCCAGTTA CCTTCGGAAA 
GCCXCACGAC TTCGGTCAAT GGAAGCCT7X 


AAGAGTTCGT 
TTCTCAACCA 




5251 


AGCTCTTGAT 
TCGAGAACTA 


CCGGCAAACA AACCACCGCT GGTAGCGGTG 
GGCCGTTTGT TT6GTGGCGA CCATCGCCAC 


GTWTTTTGT 
CAAAAAAACA 




5301 


TTGCAAGCAG 
AACGTTCGTC 


CAGATTACGC GCAGAAAAAA AGSATCTCAA GAAGATCCTT 
GTCTAATGCG CGTCTTTTTT TCCTAGAGTT CTTCTAGGAA 




5351 


TCATCTTTTC 
ACSAGAAAAG 


TACGGGGTCT GACGCTCAGT GGAACGAAAA 
ATGCCCCAGA CTGCGAGTCA CCTTCCTCTT 


CTCACGT7AA 
GAGTGCAATO 




£401 


GGGATTTTGG 
CCCTAAAACC 


TCATGAGATO ATCAAAAACC ATCTTCACCT 
ACTACTCTAA TACTTTTTCC TAGAAGTCGA 


ACATCCTTTT 
TCTAGGAAAA 




5451 


AAATTAAAAA 
TTTAATTTTT 


TGAAGTTTTA AATCAATCTA AAGTATATAT 
ACTTCAAAAT TTAGT7AGAT TTCATATATA 


GAGTAAACTT 
CTCATTTGAA 




5501 


CGTCTGACAG 
CCAGACTCTC 


TTACCAATGC TTAATCAGTG AGGCACCTAT CTCAGCGATC 
AATGGTTACG AATTAGTCAC TCCGTGGATA GAGTCGC7AG 




5551 


**GTCTATTTC 
ACAGATAAAG 


GTTCATCCAT AGTTGCCTGA CTCCGGGGGG 
CAAGTAGSTA TCAACGGACT GAGGCCCCCC 


GGGGGGCGCT 
CCCCCCGCGA 




5601 


GAGG7CTGCC 
CTCCAGACGG 


TCG7CAACAA GGTGTTGCTG ACTCATACCA 
ACCACTTCTT CCACAACGAC TGACTATGGT 


GGCCTGAATC 
CCGGACTTAG 




5651 


GCCCCATCAT 
CGGGGTAGTA 


CCAGCCAGAA AGTGAGGGAG CCACGCTTGA 
GGTCGGTCTT TCACTCCCCC GGTGCCAACT 


TGAGAGCTTT 
ACTCTCGAAA 




5701 


GTTGTAGGTG 
CAACATCCAC 


GACCAGTTGG TGATTTTGAA CTTTTGCTTT 
C7GGTCAACC ACTAAAACTT GAAAACGAAA 


GCCACGGAAC 
CGGTGCCTTG 




5751 


CCAGACGOA 


GTCCGGAAGA TGCCTCATCT GATCCTTCAA 
CAGCCCTTCT ACGCACTAGA CTAGGAAGTT 


CTCAGCAAAA 
GAGTCCTTTX 




5801 


GTTCGATTTA TTCAACAAAG CCGCCGTCCC CTCAAGTCAG 
CAACCTAAAT AAG7TGT77C GGCCCCAGGQ CAGTTCACTC 


CCTAATCCTC 
GCATTACGAG 




5B51 


TGCCAGTGTT ACAACCAATT AACCAATTCT CACTAGAAAA 
ACCCTCACAA TGTTGGTTAA TTGGTTAAGA CTAATCTTTT 


ACTCATCCAG 
TGAGTAGCTC 




5901 


CATCAAATGA AACTGCAATX TATTCATATC AGGATTATCA ATACCATATT 
GTAGTWACT TTGACCTTAA ATAAGTATAG TCCTAATAGT TATGCTATAA 




5951 


TTTGAAAAAG 
AAACTTTTTC 


CCGTTTCTGT AATGAAGGAG AAAACTCACC GAGGCAGTTC 
GGCAAACACA 77ACTTCCTC OTTTGAGTGG CTCCGTCAAG 
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6001 CATOGGATGG CAAGATCCTG GTATCGGTCT GCGATTCCGA CTCGTCCAAC 
GTKXCCTACC GTTCTAGGAC CATAGCCAGA CCCTAACGCT GAGCAGCTTG 



6051 ATCAATACAA CCTATTAATT TCCCCTCOTC AAAAA9AACG CTAT CAA OTC 
TAGTTATCTT GGATAATTAA AGGGGAGCAG TTTTTATTCC AATAGTTCAC 



Hindzil 



6101 AGAAASCACC ATGAGTGACG ACTCAATCCG GTGAGAATCG CAAAAGCTTA 
VllV.A CTGG TACTCACtCC TGACTTAGGC CACTCTTACC GTTTTCCAAT 



6151 ^GCATTTCTT TCCAGACTTG TTCAACAGGC CAGCCATTAC GCTCGTCATC 
ACGTAAAGAA AGGTCTCAAC AAGTTGTCCG GTCGGTAATG CGAGCAGTAG 



6201 AAAATCACTC GCATCAACCA AACCGTTATT CATTCGTGAT TGCGCCTCAG 
TTCTAGTGAG CGTAGTTGGT TTGGCAATAA CTAAGCACTA ACCCGGACTC 



PVUZ 

6251 CCAGACGAAA TACGCGATCG CTCTTAAAAG GACAATOACA AACAGGAATC 
GCTCTGCTTT ATGCGCTAGC GACAATTTTC CTGTTAATGT TTGTCCTTAG 



6301 GAATGOACC GGCGCAGGAA CACTGCCAGC GCA2CAACAA TATTTOCACC 
CTOCCTTGG CCGCGTCCTT GTGACCGTCG CGTAGXTGTT ATAAAAGTGG 



6351 TGRATCAGGA TATTCTTCTA ATACCTGGAA TCCTGTTTTC CCGGGGATCG 
ACTTAGTCCT ATAAGAAGAT TATG6ACCTT ACGACAAAAG GGCCCCTAGC 



6401 CAGXCCTGAG TAACCATGCA TCATCAGGAG TACGGATAAA ATGCTTGATG 
CTCACCACTC A7TGGTACGT AGTAGTCCTC ATGCCTATTT 7ACGAACTAC 



6451 G7CGGAAGAG GCATAAATTC CGTCAGCCAG TTTAGTCTGA CCATCTCATC 
CAGCCT3CTC CGCATTTAAG GCAGTCGGTC AAATCAGACT CCTAGAGTAG 



6501 TGTAACATCA TTGGCAACCC TACCTTTGCC ATGTTTCAGA AACAACTCTG 
ACATTCTAGT AACCCTTGCG ATGGAAACGG TACAAAGTCT TTGTTGAGAC 



Clal 



6551 GCGCATCGGG C CT C CCATAC AATCGATAGA CTGTCCCACC TGATTGCCCG 
CCCGTAGCCC CAAGGGTATC TTAGCTATCT AACAGCGTGG ACTAACGGGC 



6601 ACATTATCCC GAGCCCATTT ATACCCATAT AAATCAGCAT CCATGTTGGA 
TGXAA7AGCG CTCGSGTAAA TATGCCTATA TTTAGTCCTA GGTACAACCT 



Xhol 



6651 ATTTAATCGC CCCCTCGAGC AAGACGTTTC CCCTTGAATA TGCCTCATAA 
TAAATTAGCG CCGGAGCTCG TTCTGCAAAG GGCAACTTAT ACCGAGTATT 



6701 CACCCCTTGT ATTACTCTTT ATGTAACCAG ACAGTTTTAT TGTTCATGAT 
GTCGGQAACA TAATGACAAA TACATTCGTC TGTCAAAATA ACAAGTACTA 



Drain 



6751 GATATATTTT TATCTTCTCC AATGTAACAT CAGAGATTTT GAGACACAAC 
CZA7ATAAAA ATACAACACG TTACATTGTA GTCTCTAAAA CTCTCTCTTG 
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6801 


Drain 

CTCCeTTTCC CCCCCCCCCC ATTATTGAAG CATTTATCAG GGTTATTOTC 
CACCGAAAGG GGQG6GGG6G TAATAACTTC OTAAATAGTC CCAATAACAC 




6851 


CCATGAGCCC ATACATATTT GAATGTATTT AGAAAAATAA ACAAATAGGG 
AGXACTCGCC TATGTATAAK CTTACATAAA TCTTTTTATT TGTTTATCCC 




63 01 


GT7CCCCCCA CATTTCCCCC AAAACTCCCA CCTGACGTCT AAGAAACCAT 
CAABgCOCGT gTAAACCCGC TTTTCACGGT GGACTGCACA TTCTTTGCjTA 




6551 


TA2TATCATG ACATTAACCT ATAAAAATAG GCGTATCACG AGGCCCTTTC 
AX&ATAGTAC TCTAATTGGA TATTTTTATC CCCATAfiTCC TCCGGGAAAQ 




7001 


GTC 
CAS 
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pVR 1012 -CP (s) "SBCi \±er\CJL~ 

General Description 0 

DKX pVR 1012 -GKS) 
Local object 

Created* 09/14/98 03:58PM 
Last Modification Date: ? <ao data) 
lenatt* 7073 bp 
storage types Basic 
form: Circular 
Ccoroents 

Restriction Map 
Ball: 1 site 



Bell: 1 site 
Clainsite *jggg» 
Dralll: 1 site C C S 
Hindlll: 1 site 
K&sl: t site ccgcgc 
Kpnl: 1 site gggg: 
Narl: 1 site ggggj 
PmlMsite £*gg° 



Pvul: 1 site 

Sacll:1site §°°gg 

Sail: 1 site gregg 

Xba»:1site J«aga 

Y mA i, i GAANKNNTTC 

Amni. i sue cotnnhnaag 
Ndel:2site$ gggg 
EcoRV:3sites <»$*g 
Sphl:3sites g££gg 
Ncol: 4 sites §g**g 
BamHl: 6 sites 

Functional Map 
CDS (4 signals) 
CMV IE 5* UT 
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Start: 636 Ends 1129 
CMV IE INT 

Start: 1130 Ends 1B40 

TbGH 

Start: 4090 End: 4642 

Kan r 

Start: 613 8 End; 6760 (Ccrapl errant a ry) 

Miscjteature (2 signals) 

CMV enhancer 

Starts 248 End J 885 
QP(S) 

Start: 1370 End: 4089 



Annotations 
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1 TCCCGCGTTT CGGTGATGAC GGTGAAAACC TCTGACACAT GCAGCTCCCC 
A3CGCGCAAA GCCACTACTG CCACTTTTGG AGACTGTGTA CGTCGAGGGC 



51 GAGACGGCCA CAGCTTGTCT GTAAGCGGAT GCCGCGAGCA GACAAGCCCG 
CTCTGCCAGT GTCGAACACA CATTC GCCTA CGGCCCTCGT CTGTTCGGGC 



101 -"CTiGGGCCCG TCAGCGGGTG TTGCCGGGTG TCCGGGCTGG CTTAACTATG 
AGTCCCGCGC AGTCGCCCAC AACCGCCCAC AGCCCCGACC CAATTGATAC 



Ndel 



151 CGGCATCASA CCAGATTCTA CTGAGACTGC ACCATATGCG GTGTGAAATA 
CCCGTAGTCT CGTCTAACAT GACTCTCACG TGCTATACGC CACACTTTAT 



Ball 



201 CCGCACAGAT GCGTAAGGAG AAAATACCGC ATCAGATTGG CTATTGGCCA 
GCCGTGTCTA CGCATTCCTC TTTTATGGCG TAGTCTAACC GATAACCGGT 



251 TTGCATACGT TGTATCCATA TCATAATATG TACATTTATA TTGGCTCATG 
AACGTATGCA ACATAGGTAT AGTATTATAC ATGTAAATAT AACCGAGTAC 



301 TCCAACATTA CCGCCATGTT GACATTGATT ATTGACTACT TATTAATAGT 
AGGTTGTAAT GCCGGTACAA CTGTAACCAA TAACTCATCA ATAATTATCA 



*«=1 \ATCAATTAC GGGGTCATTA G7TCATAGCC CATATATGGA GTTCCGCGTT 
TTAGTTAATG CCCCAGTAAT CAAC7ATCGG CTATATACCT CAAGGCGCAA 



40" ACATAACTTA CGGTAAATGG CCCGCCTGGC TGACCGCCCA ACGACCCCCG 
7CTATTGAAT GCCATTTACC GGGCGGACCG ACTGGCGGGT TGCTGGGGGC 



451 CCCATTGACG TCAACAATGA CGTATGTTCC CATAGTAACG CCAATAGGGA 
GGGXAACTGC AGTTATTACT GCATACAAGG GTATCATTGC GGTTATCCCT 



501 CTTTCCATTG ACGTCAATGG GTGGAGTATT TACCGTAAAC TGCCCACTTG 
GAAACCTAAC TCCAGTTACC CACCTCATAA ATGCCATTTG ACGGGTGAAC 



Ndel 



551 GCAGTACA1C AAGTGTATCA TATGCCAACT ACGCCCCCTA TTGACGTCAA 
CGTCATGTAG TTCACATAGT ATACGGTTCA TGCGGGGGAT AACTGCAGTT 



601 TCACGGTAAA TGGCCCGCCT GGCATTATGC CCAGTACATG ACCTTATGGG 
ACTGCCATTT ACCGGGCGGA CCGCAATACG GGTCATCTAC TGGAATACCC 



NCOI 

fiSl ACTTTCC7AC TTGGCAGTAC ATCTACCTAT TAGTCATCGC TATTACCATG 
TGAAAGCATG AACCGTCATG TAGATGCATA ATCAGTAGCC ATAATGGTAC 



Ncol 

701 GTGATGCGGT TTTGGCAGTA CATCAATGGG CGTGGATACC GGTTTCACTC 
CACTACGCCA AAACCGTCAT GTAGTTACCC GCACCTATCG CCAAACTGAG 



751 ACGGGGATTT CCAAGTCTCC ACCCCATTGA CGTCAATGGG AGTTTGTTTT 
TGCCCCTAAA GGTTCAGAGG TGGGGTAACT GCAGTTACCC TCAAACAAAA 



o '9 '9 :i is gi o «;;;& » 01 eh .a. y o ji„ 



WO 99/32147 PCT/US98/27364 



SOI 


CGCACCAAAA 
CCGTGGTTTT 


TCAACGGGAC 
AGTTGCCCTG 


TTTCCAAAAT 
AAAGGTTTTA 


GTCGTAACAA 
CAGCACTGTT 


CTCCGCCCCA 
GAGGCGGGGT 


851 


TTGACGCAAA tcggcggtag 
AACTGCGTTX ACCCGCCATC 


GCGTGTACGC 
CGCACATGCC 


TGGGAGGTCT 
ACCCTCCAGA 


ATATAAGCAG 
TATATTCGTC 


501 


AGCTCGTTTA 
TCGAGCAAAT 


CTOAACCGTC 
CACTTGGCAG 


AGATCGCCTG 
TCTACCCGAC 


GAGACGCCAT CCACGCTGTT 
CTCTCCGGTA GGTGCGACAA 










sacXJ 


951 


TTGACCTCCA 
AACTGGAGGT 


TAGAAQACAC 
ATCTTCTGTC 


CG3GACCCAT 
GCCCTGGCTA 


CGAGCCTCCG 
GGTCGGAGGC 


CGGCCCCGAA 
GCCGGCCCTT 


1001 


CGGTGCATTG 
GCCACGTAAC 


GAACGCGGAT 
COTGCGCCTA 


TCCCCGTGCC 
AGGG3CACGG 


AAGAGTGACG 
TTCTCACTGC 


TAAGTACGGC 
ATTCATCGCG 










SphI 


1051 


CTATAGACTC 
GATATCTGAG 


TATAGGCACA 
ATATCCGTGT 


CCCCTTTGGC 
GGGGAAACCG 


TCTTATGCAT 
AGAATACGTA 


GCTATACTGT 
CGATATGACA 


1101 


TTTTGGCTTG 
AAAACCGAAC 


GGGCCTATAC 
CCCGGATATG 


ACCCCCGCTT 
TGGGGGCGAA 


CCTTATCCTA 
GGAATACGAT 


TAGGTCATGG 
ATCCACTACC 


1151 


TATAGCTTAG 
ATATCGAATC 


CCTATAGGTG 
GGATATCCAC 


TGGGTTATTG 
ACCCAAvAAC 


ACCATTATTG ACCACTCCCC 
TGGTAATAAC TGGTGAGGGG 


1201 


7ATTCGTGAC 
ATAACCACTC 


GATACTTTCC 
CTATGAAAGG 


ATTACTAATC 
TAATGACTAG 


GATAACACGG 
GTATTCTACC 


CTCTTTGCCA 
GAGAAACGGT 


1251 


CAACTATCTC 
GTTGATAGAG 


TATTGGCTAT 
ATAACCGATA 


ATGCCAATAC 
TACCGTTATG 


TCTGTCCTTC 
AGAGAGGAAG 


AGAGACTGAC 
TCTCTGACTG 


1301 


ACGGACTCTG 
TGCCTGACAC 


TATTTTTACA 
ATAAAAATGT 


GGATGGGGTC 
CCCACCCCAG 


CCASTTATTA 
GGTAAATAAT 


TTTACAAATT 
AAATGTTTAA 


1351 


CACATATACA 
GTGTATATGT 


ACAACGCCGT 
TGTTGCGGCA 


CCCCCCTGCC 
GGGGGCACGG 


CGCAGTTTCT 
GCGTCAAAAA 


ATTAAACATA 
TAATTTGTAT 


1401 


GCGTGGGATC 
CGCACCCTAG 


TCCACGCGAA 
AGCTGCGCTX 


TCTCGGGTAC 
AGAGCCCATC 


CTGTTCCGGA 
CACAAGGCCT 


CATGGGCTCT 
GTACCCGAGA 


1451 


TCTCCGGTAG 
AGAGGCCATC 


CGGCGGAGCT TCCACATCCG 
GCCGCCTCGA AGGTG7AGGC 


AGCCCTGCTC CCATGCCTCC 
TCGGGACCAG GGTACGGAGG 


1501 


AGCGGCTCAT GGTCGCTCGG 
TCGCCGAGTA CCAGCGAGCC 


CAGCTCCTTG CTCCTAAC AG 
GTCGAGGAAC GAGGATTGTC 


TGGAGGCCAG 
ACCTCCGGTC 


1551 


ACOTAGGCAC 
TGAATCCGTG 


AGCACAATCC 
TCGTGTTACG 


CCACCACCAC 
GGTGGTGGTC 


CAGTGTGCCG 
GTCACACGGC 


CACAAGGCCG 
GTGTTCCCGC 


1601 


TGGCCGTAGG 
ACCGCCATCC 


GTATGTGTCT 
CATACACAGA 


GAAAATGAGC 
CTTTTACTCG 


GTGGAGATTG 
CACCTCTAAC 


GGCTCGCACG 
CCGAGCGTGC 


1651 


GCTGACGCAG 
CGACTGCGTC 


ATGGAACACT 
TACCTTCTGA 


TAAGGCAGCG 
ATTCCGTCGC 


GCAGAAGAAO 


ATGCAGGCAG 
TACGTCCCTC 
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iifil CTGAGTTGTT GTAOTCTGAT AAGAGTCAGA GCTAACTCCC GTTGCCGTGC 
1701 ^^Sl CATAaLcTA TTCTCACTCT CCATTGAGGG CAACGCCACG ^ 

1751 " "tcctaacggt 'ggagggcagt gtactctgag cagtactcgt tgctgccccc 

ACAA77GCCA CCTCCCGTCA CATCAGACTC GTCATGACCA ACGACCGCGC 

Xcol 

1801 CGCGCCACCA GACATAATAG CTGACAGACT AACAGACTCT ^CTTTCCAT 
GCCCGGTGGT CTGTATTATC GACTGTCTGA TTGTCTGACA AGGAAAGGTA 

Sail 

NCOl 5ml I Bell ECO RV 

18S1 GCGTCTTTTC TGCAGTCACC GTCGTCGACA CCTGTGATCA GATATCCCGG 
CCCAGAAAAG ACGTCAGTGG CAGCAGC7CT GCACACTAGT CTATAGCGCC 

Sphl Ecoav 

1901 CCGCTCTAGC TAGATGCATG CTCGAGCGGC CGCCAGTCTG ATGGATATCT 
GGCGAGATCG ATCTACGTAC GAGCTCGCCG GCGCTCACAC TACCTATAGA 

SJcoI 

1951 GCAGAATTCT ATCTTCAGGA TCTCGCCATG CAGGGTCTTA GCC7ACTCCA 

CGTCTTAAGA tacaagtcct agagcggtac ctcccagaat cggatgaggt 

2001 " ATTGCCCAGA GATAAATTTC GAAAAAGCTC TTTCTTTOTT TGGGTCATCA 
TAACGGGTCT C7ATT7AAAG CTTTTTCGAG AAAGAAACAA ACCCAGTAGT 

20*51 " TCTTATTTCA AAACGCCTTT TCCATGCCTT TGGGTGTTCT GACCAACAGC 
AJGAATAAAGT TTTCCGGAAA AGGTACGGAA ACCCACAACA CTGGTTGTCG 

5101 " ACTTTAGAAG TAACAGAGAT TGACCAGCTA GTCTGCAAGG ATCATCTTGC 
tcaaItcttc ATTGTCTCTA ACTCCTCGAT CAGACGTTCC -agtagaacg 

21S1 ATCAACTGAC CAGCTGAAAT CAGTTGGTCT CAACCTCGAG GGGAGCGGAG 
TAGTTGACTG GTCCACTTTA GTCAACCAGA GTTGGAGCTC CCCTCGCCTC 

ScoHV 

5501 TATCTACTGA TATCCCATCT GCGACAAAGC GTTGGGGCT7 CAGATCTGGT 

atIgatgak ATAGGGTAGA cgctgtttcg caaccccgaa GTCTAGACCA 

2251 GTCCCTCCCC AAGTGGTCAG CTATGAAGCA GSAGAATGGG CTGAAAAWG 
CACGGAGGGG TTCACCAGTC GATACTTCGT CCTCTTACCC GACTTTTAAC 

2301 CTACAATCTT GAAATAAAGA AACCCGACGG GAGCGAATGC TTACCCCCAC 
GATGTTAGAA CTTTATTTCT TTGGCCTCCC CTCGCTTACG AATCCGGGTG 

2351 CGCCGGATGG TGTCAGAGGC 7TTCCAAGGT CCCGCTATCT TCACAAAGCC 
GCGGCCTACC ACAGTCTCCG AAAGGTTCCA CGGCGATACA AGTGTTTCGG 

2401 CAAGGAACCG GGCCCTGCCC CGGTGACTA? GCCTTTCACA AGGATGGAGC 
GTTCCTTGGC CCGGGACGGG CCCACTCATA CGGAAAGTGT TCCTACCTCG 

24S1 TWCTTCCXC TATGACACCC TGGCTTCAAC TGTAATTTAC AGAGGAGTCA 
AAAGAAGGAG ATACTCTCCG ACCGAAGTTG ACATTAAA7C TCTCCTCAGT 
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2501 AXTCTGCTGA CCGGGTAATC GCATTCTTGA TATTGGCTAA ACCAAAGGAA 
TAAAACGACT CCCCCATTAG CGTAAGAACT ATAACCGATT TGGTTTCCTT 



2551 


ACGTTCCTTC 
TGCAAGGAAG 


AATCACCCCC 
TTACTGGGGG 


CATTCGAGAG 
GTAAGCTCTC 


GCAGCAAAC? 
CGTCGTMGA 


ACACTGAAAA 
TGTGACTTTT 


2601 


TACATCAAGT TACTATGCCA CATCCTACTT GGAGTACGAA ATCGAAAATT 
ATGTACTTCA ATGATACCGT GTAGCATGAA CCTCATGCTT TAGCTTTTAA 


2651 


TTGGTGCTCA ACACTCCACG 
AACCACCAGT TCTCAGGTCC 


ACCCTTTTCA AAAWAACAA 
TGGGAAAAGT TTTAATTGTT 


TAATACTTW 
ATTAXGAAAA 


2701 


CTTCTTCTGG ACACGCCCCA CACGCCTCAG 
CAAGAAGACC TGTCCGGGGT GTGCCGAGTC 


TTCCTTTTCC 
AAGGAAAAGG 


AGCTGAATGA 
TCGACTTACT 


2751 


TACCATTCAA 
ATCGTAAGTT 


CTTCACCAAC 
GAAGTGGTTG 


AGTTGAGCAA CACAACTGGG AAACTAATT? 
TCAACTCGTT GTGTTGACCC TTTGATTAAA 


2801 


GGACACTAGA 
CCTGTGATCT 


TGCTAATATC 
ACGATTATAG 


AATGCTGATA 
TTACGACTAT 


TTCGTGAATG 
AACCACTTAC 


GGCTTTTTGG 
CCGAAAAACC 


2B51 


GAAAATAAAA 
CTTTTATTTT 


AAATCTCTCC 
TTTACAGAGG 


GAACAACTAC 
CTTGTTGATG 


GTGGAGAAGA 
CACCTCTTCT 


GCTGTCTTTC 
CGACAGAAAG 


2301 


GAAACTTTAT 
CT7TCAAATA 


CGCTCAACGA 
GCGAGTTGCT 


GACAGAAGAC 
CTGTCTTCTG 


GATGATGCGA 
CTACTACGCT 


CATCGTCGAG 
GTAGCAGCTC 


2951 


AACTACAAAG 
TTGATGTTTC 


GGAAGAATCT 
CCTTCTTAGA 


CCGACCGGGC 
GGCTGGCCCG 


CACCAGCAAG 
G7GGTCCTTC 


TATTCGGACC 
AEAAGCCTGG 


3001 


TGGTTCCAAA 
ACCAACGTTT 


GGATTCCCCT GGGAXGGTTT 
CCTAAGGGGA CCCTACCAAA 


CATTGCACGT 
GTAACGTGCA 


ACCAGAAGGG 
TCGTCTTCCC 


3051 


GAAACAACAT 
CrTTGTTGTA 


TGCCGTCTCA 
ACCGCAGAGT 


CAATTCGACA 
CTTAAGCTGT 


GAAGGTCGAA 
CTTCCAGCTT 


GAG7ACATGT 
CTCATCTACA 


3101 


GAATACTCAG 
CTTATGAGTC 


GAAACTATCA 
CTTCGATAGT 


CAGAGACAAC 
GTCTCTGTTG 


TGCAACAATC 
ACGTTGTTAG 


A7AGGCACTA 
TATCCGTCAT 


3151 


ACGGTAACAA 
TGCCATTGTT 


CATGCAGATC 
GTACGTCTAG 


TCCACCATCG 
AGGTCCTAGC 


GGACAGGACT 
CCTGTCCTGA 


GAGCTCCACC 
CTCGAGCTCG 


3201 


CAAATCCTGA 
GTTTAGGACT 


GTTCCTCACC 
CAAGGAGTGG 


Ncol 

GACCATGCCA 
CTGGTACCGT 


CCAACCCCTG 
GGTTCGGGAC 


AGACTCAGAC 
TCTGAGTCTG 


3251 


CTCCACAACC 
GAGCTGTTGG 


TACACACCAA 
ATGTGTCGTT 


AACTACCAGT GATOACCACC 
7TGATGGTCA CTACTGGTCG 


GAGGAACCAA 
CTCCTTGGTT 


3301 


CAACACCACC 
GT7GTGGTGG 


GAGAAACTCT 
CTCTTTGAGA 


CCTGGCTCAA 
GGACCGAGTT 


CAACAGAAGC 
GWCTCTTCG 


ACCCACTCTC 
TGGGTGAGAG 



3351 ACCACCCCAG AGAATATAAC AACAGCGGTT AAAACTGTTT GGGCACAAGA 
TCCTGGGGTC TCTTATATTG TTCTCGCCAA TTTTGACAAA CCCGTGTTCT 
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3401 


GTCCACAAGC 
CAGGTGTTCG 


AACGGTCTAA 
TTGCCAGATT 


TAACTTCAAC AGTAACAGGT ATTCTTGCCA 
ATTGAAGTTG TCATTGTCCA TAAGAACCCT 




3451 


GCCTTGGACT 7CQAAAACGC 
CGGAACCMA. AGCTTTTGCG 


AGCAGAAGAC AAGTTAACAC CAGGGCCACG 
TCCTCTTCTG OTCAATTGTG G7CCCGGTGC 




* 3501 


GGTAAATGCA 
CCATTTACGT 


ATCCCAACTT 
TAGGGTTGAA 


ACACTACTGG ACTGCACAAG AACAACATAA 
TGTGATGXCC TGACGTGTTC TTGTTGTATT 








BaiSHI 




3551 


TCCTGCTGGG ATTGCCTGGA TCCCCTACTT TCGACCGGGT GCAGAAGGCA 
ACGACGACCC TAACCGACCT AGGGCATGAA ACCTGGCCCA CGTCTTCCGT 




3601 


ATATGTGACT 


TCCGGAATAC 


LALAAiLLAAA A«V*LL11av1 wiw4uUKU«V 

GTGTTGGTTT TACGGAATCA GACACCTGAG 




3651 


AGACAACTTG 
TCTGTTGAAC 


CAAATGAAAC 
GTTTACTTTG 


AACTCAAGCT CTGCAGCTTT TCTTAAGGGC 
OTGAGTTCGA GACGTCGAAA AGAAT^CCCG 




3701 


CACGACGGAG 
GTGC^GCCTC 


CTGCCGACAT 
GACGCCTGTA 


ATACCATACT CAA-AGGAAG GCCATAGATT 
TAvGGTATGA GTTATCCTTC CGGTATCTAA 










BaxoKI 




3751 


TCCTTCTGCC 
AGGAAGACCC 


ACGATGGGCC 
TGCTACCCCG 


GGGACATGTA GGATCCTGGG ACCAGATTGT 
CCCTGTACAT CCTAGGACCC TCGTCTAACA 




3801 


VGCATTGAGC 
ACGTAACTCG 


CACATUATTG 
GTGTACTAAC 


rtrc*&m%ftp ft«fi^i^TftiT*a illTTlirn 
uALUUiAAAU ATUaUIuAIA WUiiuUU.W\ 

CTGGTTTTTC TAGTGACTAT TTTAGTTCGT 




3851 


AATCATCCAT 
TTAGTAGGTA 


GATTTCATCG 
CTAAAGTAGC 


ACAACCCTTT ACCCAATCAG GATAATCATG 
TGTTGGGAAA TGGGTTAGTC CTATTACTAC 










BanHI 




3901 


A7AATTGGTG 
TATTAACCAC 


GACGGGCTGG 
CTGCCCGACC 


AGACAGTGGA TCCCTGCAGG AATAGGCATT 
TCTGTCACCT AGGGACGTCC TTATCCCTAA 




3551 


ACTGGAATTA 
TGACCT7AAT 


TTATTCCAAT 
AACAACGTTA 


CATTGCTCTT CTTTGCGTCT GCAAGCTCCT 
GTAACGAGAA GAAACCCAGA CGTTCGACGA 










BamKX 




4001 


CTGTTGAATA 
AACAACTTAT 


TCAGAATTCC 
AGTCTTAACG 


AGCACTGGCG GCCGTTACTA GTGGATCCGA 
TCGTGACCGC CGGCAATGAT CACCTAGGCT 










Narl 






SemHX 


XbaZ 


KasI EaxnHX 




4051 


GCTCGGATCC 
CGAGCCTAGG 


AAGCTCTAGA 
TTCGAGATCT 


CCAGGCCCCT GGATCCAGAT CTGCTGTGCC 
GGTCCGCGGA CCTAGGTCTA GACGACACCG 




4101 


TTCTAGT2GC 
AAGATCAACC 


CAGCCATCTG 
GTCGGTACAC 


TTGTTTGCCC CTCCCCCGTG CCTTCCTTCA 
AACAAACGGG GAGGGGGCAC GGAAGGAACT 





4151 CCCTGGAAGG TGCCACTCCC ACTGTCCTTT CCTAATAAAA TGACGAAATT 
GGGACCTTCC ACGGTGAGCG TGACAGGAAA GGATTATTTT ACTCCTTTAA 
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4201 GCATCGCATT GTCTGAGTAG GTGTCAOTCT ATTCTGGGGG GTGGGGTGGG 
CGTAGCGTAA CAGACtCATC CACAGTAAGA TAAGACCCCC CACCCCACCC 



Spill 



4251 GCAGCACAGC AAGGGGGAGG ATTGGGAAGA CAATAGCAGG CATGCTGGGG 
CGTCGTGTCG TTCCCCCTCC TAACCCTTCT GTTATCGTCC GTACGACCCC 



xpnr 



43 01 ATGCGGTGGG CTCTATGGGT ACCCAGGTGC TGAAGAATTG ACCCGGTTCC 
TACGCCACCC GAGATACCCA TGGGTCCACC ACTTCTTAAC TGGGCCAAGG 



4351 TCC7GGGCCA GAAAGAAGCA GGCACATCCC CTTCTCTGTG ACACACCCTG 
AGGACCCCGT CTTTCTTCGT CCGTGTAGGG GAAGAGACAC TGTGTGGGAC 



4401 TCCACGCCCC TGGTTCTTAG TTCCAGCCCC ACTCATAGGA CACTCATAGC 
AGGTGCGGGG ACCAAGAATC AAGGTCGGCG TGAGTATCCT GTGAG7ATCG 



44 SI TCAGGAGGGC TCCGCCTTCA ATCCCACCCG CTAAAGTACT TGGAGCGGTC 
AGTCCTCCCG AGCCGGAAGT TAGGGTGCGC GATTTCATGA ACCTCGCCAG 



4501 TCTCCCTCCC TCATCAGCCC ACCAAACCAA ACCTAGCCTC CAAGAGTGGG 
AGAGGGAGGG AG7AGTCCGG TGGTTTGGTT TGGATCGGAG GTTCTCACCC 



4551 AAGAAATTAA AGCAAGATAG GCTATTAAGT GCAGAGGGAG AGAAAATGCC 
TTCTTTAATT TCGTXCTATC CGATAATTCA CGTCTCCCTC TCTTTTACGG 



xroni 



4601 TCCAACATCT GAGGAAGTAA TCAGAGAAA? CATAGAATTT CTTCCCCTTC 
AGGTTGTACA CTCCTTCATT ACTCTCTTTA GTATCTTAAA GAACGCGAAG 



4651 CTCGCTCACT GACTCGCTGC GCTCGGTCCT TCGGCTGCGG CGAGCGGTAT 
GAGCGAGTGA CTGAGCGACG CGAGCCAGCA AGCCGACGCC GCTCGCCATA 



4701 CAGCrXACTC AAAGGCGGTA ATACGGTTAT CCACAGAATC AGGGGATAAC 
GTCGAGTGAG TTTCCGCCAT TATGCCAATA GGTGTCTTAG TCCCCTATTG 



4751 GCAGGAAAGA ACATGTGAGC AAAAGGCCAG CAAAAGGCCA GGAACCGTAA 
CGTCCTTTCT TGTACACTCG TTTTCCGGTC GTCTTCCGGT CCTTGGCATT 



4801 AAAGGCCGCG TTCCTGGCGT TTTTCCATAG GCTCCGCCCC CCTGACGAGC 
TTTCCGGCGC AACGACCGCA AAAAGGTATC CGAGGCGGGG GGACTGCTCG 



4851 ATCACAAAAA TCGACGCTCA AGTCAGAGGT GGCGAAACCC GACACGACTA 
TAGTGTTTW AGCTGCGAGT TCAGTCTCCA CCGCTTTGCG CTGTCCTGAT 



4901 TAAAGATACC AGGCGTTTCC CCCTGGAACC TCCCTCGTGC GCTCTCCTGT 
ATTTCTATGC TCCGCAAAGG 6GGACCTTCG AGGGAGCACG CGAGAGGACA 



4951 TCCGACCCTG CCGCTTACCG GATACCTCTC CGCCTTTCTC CCTTCGGGAA 
AGGCTGGGAC GGCGAAtGGC CTATCGACAG GCCGAAAGAG GGAAGCCCTT 



5001 GCGTGGCGCT TTCTCAATCC TCACCCTGTA GGTATCTCAG TTCGGTGTAG 
CGCACCGCGA AAGAGTTACG AGTGCGACAT CCATAGACTC AAGCCACATC 
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5051 


GTCGTTCGCT 
CAGCAAGCGA 


CCAAOCTGGG CTCTGTGCAC 
GGTTCGACCC GACACACGTG 


CAACCCCCCG 
CTTGGGCCGC 


TTCAGCCCGA 
AAGTCGGCCT 


5101 


CCGCTGCCCC 
GGCGACGCGG 


TTATCCGG7A ACTATCG7CT TGAGTCCAAC 
AATAGGCCAT TGATAGCACA ACTCAGCTTG 


CCGGTAAGAC 
GGCCATTCXG 


5151 


ACGACOTATC GCCACTGGCA GCAGCCACTC 
TGCTGAATAC CGGTGACCGT CGTCGGTGAC 


GTAACAGCAT 
CATTGTCCTA 


TAGCAGAGCG 
ATCGTCTCGC 


5201 


AGCTATGTAG 
TCCATACATC 


GCGGTGCTAC AGAGTTCTTG 
CGCCACGATG TCTCAAGAAC 


AAGTGGTGGC 
TTCACCACCG 


CTAACTACGG 
GATTGATGCC 


5251 


CTACACTAGA AGGACAGTAT TTGGTATCTG CGCTCTCCTG AAGCCACTTA 
GATCTGATCT TCCTGTCATA AACCATAGAC GCGAGACGAC TTCGGTCAAT 


5301 


CCTTCCGAAA AACAGTTGG? AGCTCTTCAT 
GGAAGCCTT? TTCTCAACCA TCGAGAACTA 


CCGGCAAACA aaccaccgct 
GGCCCTTTGT TTCGTGGCGA 


5351 


GGTAGCGGTG 
CCATCGCCAC 


GTTTTTTTGT TTGCAAGCAG 
CAAAAAAACA AACGTTCGTC 


CAGATTACGC 
CTCTAATGCC 


GCAGAAAAAA 
CGTCTTTTTT 


5401 


AGGATCTCAA 
TCCTAGAGTT 


GAAGATCCTT TGATCTTTTC 
CTTCTAGGAA ACTAGAAAAG 


TACGGGGTCT CACGCTCAGT 
ATGCCCCAGA CTGCGAGTCA 


5451 


GGAACGAAAA 
CCTTGCTTTT 


CTCACGTTAA GGGATTTTGG 
GAGTCCAATT CCCTAAAACC 


TCATGAGATT ATCAAAAAGG 
AGTACTCTAA TAGTT7TTCC 


5501 


ATCTTCACCT AGATCCTTTT AAATTAAAAA TCAAGTTTTA AATCAATCTA 
TAGAACTGGA TCTAGGAAAA TTTAATTTTT ACTTCAAAAT TTAGTCAGAT 


5551 


AAGTATATAT 
TTCATATATA 


GAGTAAACTT GCTCTGACAG 
CTCATTTGAA CCAGACTCTC 


TTACCAATGC TTAATCAGTG 
AATGGTTACG AATTAGTCAC 


S601 


AGGCACCTAT 
TCCCTGGATA 


CTCAGCGATC TGTCTATTTC 
GAGTCGCTAG ACAGATAAAG 


GTTCATCCAT 
CAAG7AGGTA 


AGTTGCCTGA 
TCAACGGACT 


5651 


CTCCCGGGGG 
CAGGCCCCCC 


CCCGGGCCCT GAGGTCTGCC TCGTGAAGAA 
CCCCCCGCGA CTCCAGACGG AGCACTTCTT 


GCTGTTGCTG 
CCACAACGAC 


5701 


ACTCATACCA 
TGAGTATGGT 


GGCCTGAATC GCCCCATCAT 
CCCGACTTAG CGGGGTAGTA 


CCAGCCAGAA AGTGAGGGAG 
GGTCGGTCTT TCACTCCCTC 


5751 


CCACGGTTGA 
GGTGCCAACT 


TGAGAGCTTT GTTGTAGGTG 
ACTCTCGAAA CAACATCCAC 


GACCAGTTGG 
CTCCTCAACC 


7GATTTTGAA 
ACTAAAACTT 


5801 


CTTTTGCTTT GCCACGGAAC GGTCTCCGTT 
GAAAACCAAA CGGTGCCTTG CCAGACGCAA 


GTCGCGAAGA 
CAGCCCTTCT 


TCCCTGATCT 
ACGCACTAGA 


5851 


GATCCTTCAA 
CTAGGAAGTT 


CTCAGCAAAA GTTCGATTTA 


TTCAACAAAG 


CCGCCCTCCC 
GCCGGGAGGG 


GAGTCGTTTT CAAGCTAAAT 


AACTTGTTTC 



5901 GTCAAGTCAC CGTAATGCTC TGCCAGTGTT ACAACCAATT AACCAATTCT 
CAGTTCAGTC GCATTACGAG ACGGTCACAA TGTTGGTTAA TTGGTTAAGA 



5951 GATTAGAAAA ACTCATCGAG CATCAAATGA AACTGCAATT TATTCATATC 
CTAATCTTTT TGAGTAGCTC GTAGTTTACT TTGACGTTAA ATAAGTATAG 
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6001 AGGATTATCA ATACCATATT TTTCAAAAAC CCGTTTCTGT AATG^OAG 
TCCTAATAGT TATOGTATAA AAACTTTTTC GGCAAAQACA TTACTTCCTC 



6051 AAAACTCACC CAOOCAGTTC CATAGCATGG CAAGATCCTO GTATCGGTCT 
TTTTGAGTGG CTCCGTCAAG CTATCCTACC CTTCTACOAC CATAGCCAGA 



6101 GCGAMCCGA CTCGTCCAAC ATCAATACAA CCTATTAATT TCCCCTCCTC 
CGCTAAGGCr GAGCAGCTTG UUGTOATGTT CGATAATTAA AGGGGACCAG 



6151 AAAAATAAGG TTATCAAGTG AGAAATCACC ATCAGTGACG ACTGAATCCG 
TTTTTATTCC AATAGTTCAC TCTCTAGTGG TACTCACTGC TGACTTAGGC 



HlndZll 



6201 GTGAGAATGG CAAAAGCTTA TGCATTTCTT TCCAGACTTG TT CAAC AGGC 
CACTCTTACC GTTTTCGAAT ACGTAAAGAA AGGTCTGAAC AAGTTGTCCG 



6251 CAGCCATTAC CCTCGTCATC AAAATCACTC GCATCAACCA AACCGTTATT 
GTCGCTAATG CGAGCAGTAG TTTTAGTGAG CGTAGTTGG? TTGGCAATAA 



Pvul 



6301 CATTCCTGAT TGCGCCTGAC CGAGACCAAA TACCCGATCG CTCCT AAAAG 
GTAAGCACCA ACGCGGACTC GCTCTGCTTT ATGCGCTAGC GACAAT7CTC 



6351 GACAATTACA AACAGGAATC GAATGCAACC GGCGCAGGAA CACTGCCAGC 
CTCTTAATGT TTGTCCTTAG CTXACGTTGG CCGCGTCCTT GTGACGGTCG 



6401 GCATCAACAA TATTTTCACC TGAATCAGGA TXCTCTTCTA ATACCTGGAA 
CGTAGTTGTT ATAAAAGTGG ACCTACTCCT ATAAGAAGAT TATGGACCTC 



6451 TGCTGTTTTC CCCGGGATCG CAG7GGTCAG TAACCATGCA TCATCAGGAG 
ACGACAAAAG GGCCCCTAGC GTCACCACTC A7TGGTACGT AGTAGTCCTC 



6501 TACGGATAAA ATCCTTGATG GTCGGAAGAG GCATAAATTC CGTCAGCCAG 
ATCCCTATTT TACGAACTAC CAGCCTTCTC CGTATTTAAG GCAGTCGGTC 



6551 7TTACTCTGA CCATCTCATC TGTAACATCA TTGGCAACGC TACCTTTGCC 
AAATCAGACT GGTAGAGTAG ACATTGTAGT AACCCTTGCG ATGCAAACGG 



Clal 

6601 ATCTTTCAGA AACAACTCTG GCGCATCGGG CTTCCCATAC AATCGATAGA 
TACAAAGTCT TTGTTGAGAC CCCGTAGCCC GAAGGGTATG TTAGCTATCT 



6651 T7GTCGCACC TGATTGCCCC ACATTATCCC GAGCCCATTT ATACCCATAT 
AACAGCCTGG ACTAACGGGC TGTAATAGCC CTCGGGTAAA TATGGCTATA 



6701 AAATCAGCAT CCATGTTGGA ATTTAATCGC GGCCTCGAGC AAGACGTTTC 
TTTAGTCGTA GGTAGAACCT TAAATTAGCG CCGGAGCTCG TTCTGCAAAG 



6751 CCGTTGAATA TGGCTCATAA CACCCCTTGT ATTACTGTTT ATGTAAGCAG 
CGCAACTTAC ACCGAGTATT CTGGGGAACA TAATGACAAA TACATTCGTC 



6801 ACAGTTTTAT TGTTCATGAT GATATATTTT TATCTTGTGC AATGTAACAT 
TGTCAAAATA ACAAGTACXA CTATATAAAA ATAGAAGACG TTACATTGTA 



o l 9 gs :l gi o gi 1)f o s :ii 7 01 ± 

WO 99/32147 PCTAJS98/27364 

22 



Dralll 



£851 CAGAGATTTT GAGACACAAC GTGGCTTTCC CCCCCCCCCC ATTATTGAAG 
CTCTCTAAAA CTCTGTGTTC CACCCAAAGG GGGGGGGGGG TAATAACTTC 



€901 CATMArPCAG GGTTATTGTC TCATCAGCGG ATACATATTT GAATGTATTT 
GTAAATACTC CCAATAACAG AGTACTCGCC TATGTATAAA CTTACATAAA 



6951 AGAAAAATAA ACAAATAGGG GTTCCGCGCA CATTTCCCCG AAAAGTGCCA 
TCTTTTTATT TGTTTATCCC CAAGGCGCGT GTAAAGGGGC TTTTCACGGT 



7001 CCTGACGTCT AAGAAACCAT TATTATCATG ACATTAACCT ATAAAAATAG 
GCACTGCAGA TTCTTTGGTA ATAATAGTAC TGTAATTGGA TATTTTTATC 



7051 GCGTATCACG AGGCCCTTTC GTC 
CGCATAGTGC TCCGGGAAAG CAG 
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pVR 1012-GP(Z) -t— w \r , ^ 

General Description 7r>edU«*^CX^ v 

DNA pVR 1012 -GP<Z) U 
Local object 

Creatadx 09/15/98 05:06PM 
Laat Modification Date: ? (no data) 
length: 7285 bp 
storage type: Basic 
form: Circular 
Conenents 

Restriction Map 

Dralll: 1 site g*S£*g 
HlndHI: 1 site **gg» 
Hpal:1site grjgc 
Kast:lsite gggjg 
Nar!:1site ggggg 
Notl:1site ligg^gl 
Pmll: 1 site §*g»<j 
Pvuinsite 
SacllMsite iit^g 
Xbal:lsite ™™ 
Xholrlsite 

EcoRV:2sites {gjggg 
NcoI:2sites gggg 
Ndel: 2 sites g*g$"s 
Sphl:2sites 

Functional Map 

COS (4 signals) 
CMV IE 5* UT 

StBrtt 886 End: 1129 
CMV IE INT 

Start: 1130 2nd: I860 
TbGH 

Start: 4302 End: 48S4 

Kan r 

Start: 6350 End: 6972 (Canplercantary) 
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Misc_f ature (2 signals) 
CMV enhancer 

Start: 248 End: 885 
GP(Z) 

Start* 1870 End: 4301 

Annotations 
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1 


TCGCGCGTTT 
ACCGCGCAAA 


CGGTGATGAC 
GCCACTACTG 


GGTGAAAACC 
CCACTTTTGG 


TCTGACAGAT 
AGACTGTGTA 


GCAGCTCCCG 
CGTCGAOGGC 


51 


GAGACGGTCA 
CTCTGCCAGT 


CAGCTTGTCT 
GTCGAACAGA 


GTAACCGGAT 
CATTCGCC7A 


GCCGGGAGCA 
CGGCCCTCGT 


GACAAGCCCG 
CTGTTCGGGC 


101 


TCAGGGCGCC 
AGTCCCGCGC 


7CAGCCGGTG 
AGTCGCCCAC 


TTGGCGGGTG TCGGGGCTCG 
AACCGCCCAC AGCCCCCACC 


CTTAACTATG 
GAATTGATAC 










NdftI 




151 


CGGCATCAGA 
CCCGTAGTCT 


GCAGATTGTA 
CGTCTAACAT 


CTGAGAGTGC 
GACTCTCACG 


ACCATATGCG 
TGGTATACGC 


GTGTGAAATA 
CACACTTTAT 


201 


CCGCACAGAT GCGTAAGGAG 
GGCGTCTCTA CCCATTCCTC 


AAAATACCGC 
TTTTATGCCG 


ATCAGATTGG 
TAGXCTAACC 


CTATTGGCCA 
GATAACCGCT 


251 


TTGCATACGT TGTATCCATA 
AACGTATGCA ACATAGGTAT 


TCATAATATG 
AGTATTATAC 


TACATTTATA 
ATGTAAATAT 


TTGGCTCATG 
AACCGAGTAC 


301 


TCCAACATTA 
AGGTTGTAAT 


CCGCCATGTT 
CGCGGTACAA 


CACATTGAT7 
CTGTAACTAA 


ATTGACTAGT TATTAATAGT 
7AACTGATCA ATAA77ATCA 


351 


AATCAATTAC 
TTAGTTAATG 


GGGGTCATTA 
CCCCAGTAAT 


GTTCATAGCC 
CAAGTA7CGG 


CATATATGGA 
CTATATACCT 


GTTCCGCCTT 
CAAGGCGCAA 


401 


ACATAACTTA 
TGTATTGAAT 


CGGTAAATGG 
CCCATTTACC 


CCCGCCTGGC 
GGGCGGACCG 


TGACCGCCCA ACGACCCCCG 
ACTGGCGGGT TGCTCGGGGC 


451 


CCCATTCACG 
GGGTAACTGC 


TCAATAAXGA 
AGTTATTACT 


CGTATCTTCC 
GCATACAAGG 


CATAG7AACG 
GTATCAMGC 


CCAA7AGGGA 
GGTTATCCCT 


501 


C7TTCCATTC 
CAAAGGTAAC 


ACGTCAATGG 
TGCAGTTACC 


GTCGAG7ATT 
CACCTCATAA 


TACGGTAAAC 
ATGCCATTTG 


7GCCCACTTC 
ACGGGTGAAC 


551 


GCAGTACA7C 
CG7CATGTAG 


Ndel 

AAGTGTATCA TATGCCAAGT 
TTCACATAGT ATACGGTTCA 


ACGCCCCCTA 
TGCGGCCGAT 


T7GACGTCAA 
AACTGCAGTT 


601 


TGACGGTAAA 
ACTGCCATT7 


TGGCCCCCCT 
ACCGGGCGGA 


GGCAT7ATGC 
CCGTAATACG 


CCAGTACATG 
GGTCATG7AC 


ACCTTATGGG 
TGGAATACCC 












Ncol 


651 


ACTTTCCTAC 
TGAAAGGATG 


TTGGCAGTAC 
AACCGTCATG 


ATCTACGTAT TAGTCA7CGC TATTACCATG 
TAGATGCATA ATCAGTAGCG ATAATGGTAC 




NeoX 










701 


GTCATGCCGT 
CACTACGCCA 


TTTGGCAGTA 
AAACCGTCAT 


CATCAATGGG 
GTAGTTACCC 


CG7CGATAGC 
GCACCTATCG 


CGTTTGACTC 
CCAAACTGAG 


751 


ACGCCGA7TT 
TCCCCC7AAA 


CCAAGTCTCC 
GGTTCAGAGG 


ACCCCATTGA 
TGGGGTAACT 


CGTCAATGGG 
GCAGTTACCC 


AGTTTGTTTT 
TCAAACAAAA 


801 


GGCACCAAAA 
CCGTGCTTTT 


TCAACGGGAC 
ACTTGCCCTG 


TTTCCAAAAT 
AAAGGTTTTA 


GTCGTAACAA 
CAGCATTGTT 


CTCCGCCCCA 
GAGGCGGGCT 
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831 TTGACGCAAA TGGGCCOTAG GCCTGTACGG TCGGAGGTCT ATATAAGCAG 
AAC7GCGTTT ACCCGCCATC CGCACATGCC ACCCTCCAGA TATATTCGTC 



901 AGCTCCTTTA GTCAACCGTC AGATCGCCTO GAOACCCCAT CCACCCTGTT 
TCGACCAAAT CACTTGGCAG TCTAGCCGAC CTCTCCGGTA GGTGCGACAA 



sac 1 1 



951 TTGACCTCCA TAGAAGACAC CGGGACCGAT CCAGCCTCCG CGGCCGGGAA 
AACTGGACGT ATCTTCTGTG CCCCTGGCTA GGTCGGAGGC GCCGGCCCTT 



1001 CGGTGCATTG GAACGCGGAT TCCCCGTGCC AAGAGTCACG TAAGTACCGC 
GOCACCTAAC CTTGCGCCTA AGGGGCACGG TTCTCACTGC ATTCATGGCG 



sphi 

1051 CTATAGACTC TATAGGCACA CCCCTTTCGC TCTTATGCAT GCTATACTGT 
GA2A7CTGAG ATATCCCTGT GGGGAAACCG AGAATACGTA CGATATGACA 



1101 TTTTGGCTTG GGGCCTATAC ACCCCCGCTT CCTTATGCTA TAGGTGATGC 
AAAACCGAAC CCCGGATATG TGGGGGCGAA GGAATACGAT ATCCACTACC 



1151 TATAGCTTAG CCTATAGGTG TGGGTTATTG ACCATTATTG ACCACTCCCC 
ATATCGAATC GGATATCCAC ACCCAATAAC TGGTAATAAC TGGTGACGGG 



-201 TATTGGTGAC GATACTTTCC ATTACTAA7C CATAACASGG CTCTTTGCCA 
ATAACCACTG CTATGAAAGG TAATGATTAG GTATTGTACC GAGAAACGGT 



1251 CAACTATCTC TATTGGCTAT ATGCCAATAC TCTGTCCTTC ACAGACTGAC 
GTTGATAGAG ATAACCGATA TACGGTTATG AGACAGGAAG TCTC7GACTC 



130* ACGGACTCTG TATTTTTACA GCATGGGGTC CCATTTATTA TTTACAAATT 
fTGCCTGAGAC ATAAAAATGT CCTACCCCAG GG7AAATAAT AAATGTTTAA 



1351 CACATATACA ACAACGCCGT CCCCCGTGCC CGCAGTTTTT ATTAAACATA 
GTCTATATGT TGTTGCGGCA GGGGGCACGG CCGTCAAAAA TAATTTGTAT 



1401 COOTGGGATC TCCACGCGAA TCTCGGCTAC GTGTTCCGGA CATGGGCTCT 
CGCACCCTAG AGGTGCGCTT AGAGCCCATG CACAAGGCCT GTACCCGAGA 



1451 "CTCCGGTAG CGGCGGAGCT TCCACATCCG AGCCCTGGTC CCATGCCTCC 
AGAGGCCATC CCCGCCTCGA AGGTGTAGGC TCGGGACCAG GGTACGGAGG 



1501 AGCGCCTCAT CCTCGCTCCG C AGCTCCTTG CTCCTAACAG TCGACCCCAG 
TCGCCGAGTA CCAGCGAGCC GTCGAGGAAC GAGGATTCTC AOCTCCGGTC 



1551 ACTTAGGCAC AGCACAATGC CCACCACCAC CAGTGTGCCG CACAAGGCCG 
TGAATCCOTG TCGTGTTACG CGTGGTGGTG GTCACACCGC GTGTTCCGGC 



1601 TGGCGGTAGG GTATGTCTCT GAAAATGAGC GTGGACATTG GCCTCGCACG 
ACCCCCATCC CATACACAGA CTTTTACTCG CACCTCTAAC CCGAGCGTGC 



1^51 GCTGACGCAG ATGGAAGACT TAAGGCAGGG CCAGAAGAAG ATGCAGGCAG 
CGACTGCGTC TACCTTCTGA ATTCCGTCCC CGTCTTCTTC TACGTCCGTC 



1701 CTGAGTTGTT GTATTCTOAT AAGAGTCAGA GGTAACTCCC GTTGCGGTGC 
GAC7CAACAA CATAAGACTA TTCTCAGTCT CCATTGAGGG CAACGCCACC 
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1751 


Hp&X 

TGTTAACGGT 
ACAATTGCCA 


GGAGGGCAGT 
CCTCCCGTCA 


GTAGTCTGAG 
CATCAGACTC 


CAGTACTCGT TCCTGCCGCG 
CTCATGAGCA ACGACGGCGC 














NCOl 




1801 


CGCGCCACCA 
GCCCGGTGGT 


GACATAATAG 
CTGTATTATC 


CTGACAGACT 
GACTGTCTGA 


AAGAGACTGT 
TTGTCTGACA 


TCCT77CCAT 
AGGAAAGGTA 






NCOl 




Pmll 


EcoHVNOtl 




1851 


GGGTCTTTTC 
CCCAGAAAAG 


TGCAGTCACC 
ACGTCAGTGG 


G7CCTCGACA 
CAGCAGCTGT 


CCTGTGATCA 
CCACACTAGT 


GATATCGCGG 
CTATAGCGCC 








Narl 












NotI XbaX 


Kasx 










1901 


CCGCTCTAGA 
GGCGAGATCT 


CCAGGCGCCT 
GGTCCGCGGA 


GGATCGATCC 
CCTAGCTAGG 


GCGATGAAGA TTAAGCCGAC 
CGCTACTTCT AATTCGGCTG 




1951 


AGTCAGCGTA 
TCACTCGCAT 


ATCTTCATCT 
TAGAAGTAGA 


CTCTTAGATT 
GAGAATCTAA 


ATTTGTTTTC 
TAAACAAAAG 


GAGAGTAGGG 
GTCTCATCCC 




2001 


GTCGTCAGGT 
CAGCAGTCCA 


CCTTTTCAAT CGTGTAACCA 
GGAAAAGTTA GCACATTGGT 


AAATAAACTC 
TT7ATTTGAG 


CACTAGAAGG 
G7CATCTTCC 




2051 


ATATTGTGGG 
XATAACACCC 


GCAACAACAC 
CG7TGTTGTG 


AATGGGCGTT 
T7ACCCGCAA 


ACAGGAATAT 
TGTCCTTATA 


TGCAGT7ACC 
ACGTCAATGG 




2101 


TCGTGATCCA TTCAAGAGGA 
AGCAC7AGCT AAGTTCTCCT 


CATCAT7CTT 
GTAGTAAGAA 


TCTTTGGGTA 
AGAAACCCAT 


ATTATCCTTT 
TAATAGGAAA 




2151 


TCCAAAGAAC ATTTTCCATC 
AGGTTTCTTG TAAAAGGTAG 


CCACTTCGAG 
GGTGAACCTC 


TCATCCACAA 
ACTAGGTGTT 


TAGCACATTA 
ATCGTGTAA? 




2201 


CACCTTAGTG 
CTCCAATCAC 


ATGTCGACAA 
TACAGCTGTT 


ACTAOTTTGT 
TGATCAAACA 


CC7GACAAAC 
GCACTCTTTG 


TGTCATCCAC 
ACAGTAGGTG 




2251 


AAATCAATTG 
TTTAGTTAAC 


AGATCAGTTG 
TCTAGTCAAC 


GACTGAATCT 
CTGACTTAGA 


CGAACGGAAT 
GCTTCCCTTA 


GGAGTGGCAA 
CCTCACCGTT 




2301 


CTGACGTGCC 
GACTGCACGG 


ATCTGCAACT 
TAGACGTTGA 


AAAACATGGG 
CTTTCTACCC 


GCTTCAGGTC 
CGAAGTCCAG 


CGGTGTCCCA 
GCCACAGGGT 




2351 


CCAAAGGTCG TCAATTATGA AGCTGGTGAA TGGGCTGAAA ACTGCTACAA 
GGTTTCCACC AGTTAATACT TCGACCACTT ACCCGACTTT TGACGATGTT 




2401 


TCTTGAAATC 
AGAACTTTAG 


AAAAAACCTG 
TTTTTTGGAC 


ACGGGAGTGA 
TGCCCTCACT 


GTCTCTACCA 
CACAGATCGT 


GCAGCGCCAG 
CGTCGCGGTC 




2651 


ACGGGATTCG 
TGCCCTAAGC 


GGGCTTCCCC 
CCCGAACGGG 


CGGTGCCCGT 
GCCACGGCCA 


ATGTCCACAA 
TACACGTGTT 


AGTATCAGGA 
TCATAGTCCT 




2501 


ACGGGACCGT 
TGCCCTGGCA 


GTGCCGGAGA 
CACGGCCTCT 


CTTTGCCTTC 
GAAACGCAAG 


CATAAAGAGG 
GTATTTCTCC 


GTGCT7TCTT 
CACGAAAGAA 
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25S1 CCTGTATCAT CGACTTGCTT CCACAGTTAT CTACCGAGGA ACGACTTTCC 
GGACATACTA GCTGAACGAA CCTOTCAATA GATGGCTCCT TGCTGAAAGC 



2601 CTCAAGGTGT CGTTGCATTT CTGATACTGC CCCAAGCTAA CAAGGACTTC 
CACTTCCACA GCAACCTAAA 6ACTATGACG GGGTTCGATT CTTCCTCAAG 



2651 TTCAGCTCAC ACCCCTTGAG A6ACCCGGTC AATGCAACGG AGGACCCCTC 
AAGTCGAGTG TCCCGAACTC TCTCGGCCAG TTACGTTGCC TCCTCGGCAG 



ECORV 



2701 TAGTGGCTAC TATTCTACCA CAATTAGATA TCAGGCTACC GGTTTTGGAA 
ATCACCGATG ATAAGATGGT GTTAATCTAT AGTCCGATCG CCAAAACCTT 



27S1 CCAATGAGAC AGAGXACTTG TTCGAGGTTG ACAATTTGAC CTACGTCCAA 
GGTTACTCTG TCTCATGAAC AAGCTCCAAC TGTTAAACTG GATGCAGGTT 



2801 CTTGAATCAA GATTCACACC ACAGTTTCTC CTCCACCTGA ATGAGACAAT 
GAACTTAGTT CTAAGTGTGG TGTCAAAGAC GAGCTCGACT TACTCTGTTA 



2851 ATATACAAGT CGGAAAAGCA GCAA7ACCAC GGGAAAACTA ATTTGGAAGC 
TATATGTTCA CCC7TTTCCT CGTTATGGTG CCCTTT7GAT TAAACCTTCC 



2901 TCAACCCCGA AATTGATACA ACAATCGGGG AGTGGGCCTT CTGGGAAACT 
ACTTGGGGCT TTAACTATGT TGTTAGCCCC TCACCCGGAA GACCCTTTGA 



2951 AAAAAAAACC TCACTAGAAA AAMCGCAG7 GAAGAGTTGT CTTTCACAGT 
TTTOTTTTGG AGTGATCTTT TTAAGCGTCA CTTCTCAACA GAAAGTGTCA 



3001 TGTAICAAAC GGAGCCAAAA ACATCAGTCG TCAGACTCCG GCCCGAACTT 
ACAXACTTTG CCTCGGT1TT TGTAGTCACC AGTCTCAGGC CGCGCT7GAA 



3051 CTTCCCACCC AGGGACCAAC ACAACAACTG AACACCACAA AATCATGGCT 
GAAGGCTGGG TCCCTCGTTG TGTTGTTGAC TTCTGGTGTT TTAGTACCGA 



3101 TCAGAAAATT CCTCTGCAA7 GGTTCAAGTG CACAGTCAAG GAAGC GAAG C 
AGTCTTTTAA GGAGACGTTA CCAAGTTCAC GTGTCAG7TC CTTCCCTTCG 



3151 TGCAGTGTCG CATCTAACAA CCCTTGCCAC AATCTCCACG AGTCCCCAAT 
ACGTCACAGC GTAGATTGTT GGGAACCGTG TTAGAGGTGC TCAGGGGTTA 



3201 CCCTCACAAC CAAACCAGGT CCGGACAACA GCACCCATAA TACACCCGTG 
GGGAGTGTTG GTTTCCTCCA GGCCTGTTCT CGTGGGTATT ATCTGGGCAC 



3251 TATAAACTTG ACATCTCTGA GCCAACTCAA GTTGAACAAC ATCACCGCAG 
ATATTTGAAC TGTAGAGACT CCGTTCAGTT CAACTTGTTG TAGTGGCCTC 



3301 AACAGACAAC GACAGCACAG CCTCCGACAC TCCCTCTGCC ACGACCGCAG 
TTGTCTGTTG CTCTCGTGTC GGAGGCTGTG AGGGAGACGG TGCTGGCGTC 



3351 CCGGACCCCC AAAAGCAGAG AACACCAACA CGAGCAAGAG CACTGACTTC 
GGCCTGOGGG TTTTCGTCTC TTCTGGTTGT CCTCGTTCTC GTGACTGAAG 



3401 CTGGACCCCG CCACCACAAC AAGTCCCCAA AACCACAGCG AGACCGCTGG 
GACCTGGCGC GGTGGTGTTG TTCAGGGGTT TTGGTGTCGC TCTGGCOAOC 
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3451 CAACAACAAC ACTCATCACC AAGATACCGG AGAACAGAGT GCCACCAOCG 
GTTGTTCTTG TGACTAGTGG TTCTATGGCC TCTTCTCTCA CGCTCGTCGC 



3501 GOAAGCTAGO CTTAATTACC AATACTATTG CTGOAGTCGC ACGACTGATC 
CCTTCGATCC CAATTAATGG TTATGATAAC GACCTCAGCG TCCTCACTAO 



3551 ACAGGCGGGA GAAGAACTCG AAGAGAAGCA ATTGTCAATG CTCAACCCAA 
TGTCCGCCCT CTTCTTGAGC TTCTCTTCGT TAACAGTTAC GAGTTGGGTT 



3601 ATGCAACCCT AATTTACATT ACTGGACTAC TCAGCATGAA GGTCCTGCAA 
TACGTTGGGA TTAAATCTAA TGACCTGATG AGTCCTACTT CCACGACGXT 



3651 TCOGACTGGC CTGGATACCA TATTTCGGGC CAGCAGCCGA GGGAATTTAC 
AGCCTGACCG GACCTATGGT ATAAAGCCCG CTCGTCGGCT CCCTTAAATC 



3701 ATAGAGGGCC TAATGCACAA TCAAGATGGT OTAATCTCTG GGTTGAGACA 
TATCTCOCCG ATOACGTGTT AGTTCTACCA AATTAGACAG CCAACTCTGT 



3751 GCTGGCCAAC GAGACGACTC AAGC2CTTCA ACTGTTCCTG AGAGCCACAA 
CGACCGGTTG C7CTGCTGAG TTCGAGAAGT TGACAAGGAC TCTCGGTGTT 



3801 CTGAGCTACG CACCTTTTCA ATCCTCAACC GTAAGGCAAT TGATTTCTTG 
GACTCGATGC GTCGAAAAGT TAGGAGTTGG CATTCCCTTA ACTAAAGAAC 



3851 CTGCAGCGAT GGGGCGGCAC ATGCCACATT CTGGGACCGG ACTGCTGTAT 
GACGTCGCTA CCCCGCCGTG 7ACGGTG7AA GACCCTGGCC TGACGACATA 



3901 CGAACCACAT GATTGGACCA AGAACATAAC AGACAAAAT? GATCAGATTA 
GC7TGGTCTA CTAACCTGGT TCTTGTATTG TCTGTTTTAA CTAGTCTAAT 



3951 TTCATGATTT TGTTGATAAA ACCCTTCCCG ACCAGGCGGA CAATGACAAT 
AAGTACTAAA ACAACTATTT TGGGAAGGCC TGCTCCCCCT GTTACTGTTA 



4001 TGGrCGACAG GATGGAGACA ATGGATACCG GCAGGTATTG GAGTTACAGG 
ACCACCTGTC CTACCTCTGT TACCTATGGC CGTCCATAAC CTCAATGTCC 



4051 CGTTATAATT GCACTTATCG CTTTATTCTG TATATG CAAA TTTGTCTTTT 
GCAATATTAA CGTCAATAGC GAAATAAGAC ATATACGTTT AAACAGAAAA 



4101 AGTTTTTCTT CAGATTGCTT CATCCAAAAG CTCAGCCTCA AATCAATGAA 
TCAAAAACAA GTCTAACGAA GTACCTTTTC GAGTCGCAGT TTAGTTACTT 



4151 ACCAGGATTT AATTATATGG ATTACTTGAA TCTAAGATTA CTTGACAAAT 
TGGTCCTAAA TTAATATACC TAATGAACTT AGATTCTAAT GAACTCTTTA 



4201 GATAATATAA TACACTGGAG CTTTAAACAT AGCCAATCTG ATTCTAACTC 
CTATTATATT ATGTGACCTC GAAATTTCTA TCGCTTACAC TAAGATTGAG 



4251 CTTTAAACTC ACAGTTAATC ATAAACAAGG TTMGTACCG AGCTCGAATT 
GAAATTTGAG TGTCAATTAG TATTTGTTCC AAACCATCGC TCGAGCTTAA 



4301 ATCTCCTGTG CCTTCTAGTT GCCAGCCATC TGTTCTTTGC CCCTCCCCCG 
TAGACGACAC GGAAGATCAA CGGTCGGTAG ACAACAAACC GGGAGGGGCC 



4351 TCCCTTCCTT GACCCTGGAA GGTGCCACTC CCACTGTCCT TTCCTAATAA 
ACGGAACGAA CTGGGACCTT CCACGGTGAG GGTGACAGGA AAGGATTATT 
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Pf T/l I^OR/77^Ad 


4401 


AATGAGGAAA TTGCATCGCA TTGTCTGAGT AGGTGTCATT C2ATTCTGGG 
TTACTCCTTT AACCTAGCGT AACAGACTCA TCCACAGTAA GATAAGACCC 




4451 


CGGTOGGGTO GGGCAGCACA GCAAGGGGGA CGATTGGGAA GACAATACCA 
CCCACCCCAC CCCCTCGTGT CGCTCCCCCT CCTAACCCTT CTCTTATCGT 






SphZ 




4501 


GCCATGCCCG GGMCCCGTO GGCTCTATGG GTACCCAGGT GCTGAAGAAT 
CCGTACGACC CCTACGCCAC CCGAGATACC CATCCCTCCA CGACTTCTTA 




4551 


TGACCCGGTT CCTCCT66GC CAGAAAGAAG CAGGCACATC CCCTTCTCTG 
ACTGGGCCAA GGAGGACCCG GTCTTTCTTC GTCCGTOTAG GGGAAGAGAC 




4601 


TGACACACCC TCTCCACGCC CCTGGTTCTT AGTTCCAGCC CCACTCATAG 
ACTCTGTGGG ACAGGTGCGG GGACCAAGAA TCAAGGTCGG GGTGAGTATC 




4551 


GACACTCATA GCTCAGGAGG GCTCCCCCTT CAATCCCACC CGCCAAAGTA 
CTGTGAG7AT CGAGTCCTCC CGACGCGGAA GTTAGGGTGG CCGATTTCAT 




4701 


CTTGGAGCGG TCTCTCCCTC CCTCATCAGC CCACCAAACC AAACCTAGCC 
GAACCTCGCC AGAGAGGGAG GGAGTAGTCC GGTGGTTTGG TTTGGATCGG 




4751 


TCCAAGAGTG GGAAGAAATT AAAGCAAGAT AGGCTATTAA GTGCACAGGG 
AGGTTCTCAC CCTTCTTTAA TTTCGTTCTA TCCGATAATT CACGTCTCCC 




4801 


AGAGAAAATG CCTCCAACAT CTGAGGAAGT AATGAGAGAA ATCATAGAAT 
TCTCTTTTAC GGAGGTTGTA CACTCCTTCA TTACTCTCTT TAGTATCTTA 




4651 


TTCTTCCGCT TCCTCGCTCA CTGACTCGC? GCGCTCGGTC GTTCGGCTGC 
AAGAAGGCGA AGGAGCGAGT GACTGAGCGA CGCGAGCCAG CAAGCCGACG 




4901 


GGCCAGCGGT ATCAGCTCAC TCAAAGGCGG TAATACGGTT ATCCACAGAA 
CCGCTCCCCA TAGTCGAGTG AGTTTCCGCC ATTATCCCAA 7AGGTGTCTT 




4951 


TCAGGGGATA ACGCAGGAAA GAACATGTGA GCAAAAGCCC AGCAAAAGGC 
AGTCCOCTAT TCCGTCCTTT CTOGTACACT CGTTTTCCGG TCGTTTTCCG 




5001 


CAGGAACCG7 AAAAAGGCCG CGTTGCTCGC GTTTTTCCAT AGGCTCCGCC 
fT/TTTYW A TTTTTCCCGC GCAACGACCG CAAAAAGGTA TCCGAGGCGG 




5051 


CCCCTGACGA GCATCACAAA AATCGACGCX CAAGTCAGAG GTGGCCAAAC 
GGGGACTGCT CGTAGTGTTT TTAGCTGCGA GTTCAGTCTC CACCGCTTTC 





5101 CCGACAGGAC TATAAAGATA CCAGGCGTTT CCCCCTGGAA GCTCCCTCGT 
GGCTGTCCTG ATATTTCTAT GGTCCGCAAA GGGGGACCTT CGAGGGAGCA 



5151 GCGCTCTCCT GTTCCGACCC TCCCGCTTAC CGGATACCM TCCGCCTTTC 
CGCGAGAGGA CAAGGCTGGG ACGGCGAATG GCCTATGGAC AGGCGGAAAG 



5201 TCCCTTCGGG AAGCGTGGCC CTTTCXCAAT GCTCACGCTG TAGGTATCTC 
AGGGAAGCCC TTCGCACCGC GAAAGAGTTA CGAGTGCGAC ATCCATAGAG 



5251 AGTTCGGTGT AGGTCCTTCG CTCCAAGCTG CGCTGTGTGC ACGAACCCCC 
TCAAGCCACA TCCAGCAAGC GAGGTTCGAC CCGACACACG TGCT7GGGGG 
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S3 01 CGTTCACCCC GACCGCTGCG CCTTATCCGG TAACTATCGT CTTCAGTCCA 
GCAAGTCGGC CTQGCC3ACGC GGAATAGGCC ATTOATAGCA GAACTCAGGT 



5351 ACCCGGTAAC AGACGACTTA TCGCCACTGG CAGCAGCCAC TGGTAACACG 
rGGGCCATTC TGTGCTGAAT AGCGGTGACC GTCGTCCCTG AOCATTCTCC 



5401 ATTAGCAGAG CGAGCTATGT AGCCGGTGCT ACAGAGTTCT TGAAGTGGTG 
TAATCGTCTC GCTCCATACA TCCGCCACGA TGTCTCAAGA ACTTCACCAC 



5451 GCCTAACTAC GGCTACACTA GAAGGACAGT ATTTGGTATC TGCGCTCTGC 
CGGATTGATG CCCATGTGAT CTTCCTCTCA TAAACCATAG ACGCGAGACG 



5501 TGAAGCCAGT TACCTTCGGA AAAAGAGTTG GTACCTCTTG ATCCGGCAAA 
ACTTCGGTCA ATGGAAGCCT TTTTCTCAAC CATCGAGAAC TAGGCCCTTT 



5551 CAAAOCACCC CTGGTAGCGG TGGTTTTTTT GTTTGCAAGC AGCAGATTAC 
CTTTGGTGGC GACCATCGCC ACCAAAAAAA CAAACGTTCG TCGTCTAATG 



5601 CCGCAGAAAA AAAGGATCTC AAGAAGATCC TTTGATCTTT TCTACGGGGT 
CGCGTCTTTT TTTCCTAGAG TTCTTCTAGG AAACTAGAAA AGATGCCCCA 



5651 CTGACGCTCA CTGGAACGAA AACTCACGTT AAGGGATTTT GGTCATGAGA 
GACTGCGAGT CACCTTGCTT TTGAGTGCAA TTCCCTAAAA CCAGTACTCT 



5701 TTATCAAAAA GGATCTTCAC CTAGATCCTT TTAAATTAAA AATGAAGTTT 
AATAGTTTTT CCTAGAAGTC GATCTAGGAA AATTTAATTT TTACT7CAAA 



5751 TAAATCAATC TAAAGTATAT ATGAGTAAAC TTGGTCTGAC AGTTACCAAT 
ATTTAGTTAG ATOTCATATA TACTC AOTTG AACCAGACTG TCAATGGTTA 



5801 GCTTAATCAG TGAGGCACCT ATCTCAGCGA TCTGTCTATT TCCTCCATCC 
CGAATTAGTC ACTCCGTGGA CAGAGTCGCT AGACAGATAA AGCAAGTAGG 



5851 A m AGTTGCCT GACTCCGGGG GGGCCGGGCG CTGAGGTCTG CCTCGTGAAG 
TAVCAACCGA CTGAGGCCCC CCCCCCCCGC GACTCCAGAC GGAGCACTTC 



5901 AAGGTGTTGC 7GACTCATAC CAGGCCTGAA TCCCCCCATC ATCCAGCCAG 
TTCCACAACG ACTGAGTATG GTCCGGACTT AGCGGGGTAG TAGCTCGGTC 



5951 AAAGTGAGGG AGCCACGGTT GATGAGAGCT TTGTTCTAGG TGGACCAGTT 
TCTCACTCCC TCGGTGCCAA CTACTCTCCA AACAACATCC ACCTGGTCAA 



6001 GGTGATTTTG AACTXTTGCT TTGCCACGGA ACGGTC7GCG TTCTCGGGAA 
CCACTAAAAC TTGAAAACGA AACGGTGCCT TCCCAGACCC AACAGCCCTT 



6051 GATGCGTGAT CTGATCCTTC AACTCAGCAA AAGTTCGATT TATTCAACAA 
CTACGCACTA GACTAGGAAG TTGAGTCGTT TTCAAGCTAA ATAAGTTGTT 



6101 AGCCGCCGTC CCCTCAAGTC AGCGTAATGC TCTCCCAGTG TTACAACCAA 
TCGGCGGCAG CGCAGTTCAG TCGCATTACG AGACGGTCAC AATGTTCGTT 



6151 TTAACCAATT CTGATTAGAA AAACTCATCG AGCATCAAAT GAAACTGCAA 
AATTGGTTAA GACTAATCTT TTTGAGTAGC TCGTAGTTTA CTTTGACGTT 



6201 TTTATTCATA TCAGGATTAT CAATACCATA OTTTTGAAAA AGCCGTTTCT 
AAA7AAGTAT AGTCCTAATA GTTATGGTAT AAAAACTTTT TCGGCAAAGA 
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6251 


CTAATGAAGG AGAAAACTCA CCGAGGCAGT TCCATAGGAT GGCAAGATCC 
CATTACTTCC TCTTTTGAG? GGCTCCGTCA AGGTATCCTA CCGTTCTAGC 




6301 


TGGTATCGGT CTGCOATTCC GACTCGTCCA ACATCAATAC AACCTATTAA 
ACCATAGCCA GACGCTAAGG CTGACCAGG? TGTAGTTATC TTGGATAATT 




6351 


TTTCCCCTCG TCAAAAATAA GGTTATCAAG TGAGAAATCA CCATGAGTGA 
AAAGGGGAGC AGTTTTTATT CCAATAGTTC ACTCTTTAGT GGTACTCACT 




6401 


HindlXI 

CGACTGAATC CGGTGACAAT GGCAAAAGCT TATGCATTTC TTTCCAGACT 
GCTCACTTAG GCCACTCTTA CCGTTTTCGA ATACGTAAAG AAAGGTCTGA 




6451 


TGTTCAACAG GCCAGCCATT ACGCTCGTCA TCAAAATCAC TCGCATCAAC 
ACAAGTTGTC CGGTCGGTAA TGCGAGCACT AGTTTTAGTG AGCCTAGTTG 






PVUl 




6501 


CAAACCGCTA OTCATTCGTG ATTGCGCCTG AGCGAGACGA AATACGCGAT 
GTTTGGCAAT AAGTAAGCAC TAACGCGGAC TCCCTCTGCT TTATGCGCTA 




6551 


Pvul 

CCCTGTTAAA AGGACAATTA CAAACAGGAA TCGAATCCAA CCGGCGCAGG 
GCGACAATTT TCCTGTTAAT GTTTGTCCTT AGCTTACGTT GGCCGCGTCC 




6601 


AACACTGCCA GCGCATCAAC AATATTTTCA CCTGAATCAG CATATTCTTC 
TTGTCACGGT CGCGTAGTTG TTATAAAAGT CCACTTAGTC CTATAAGAAG 




6651 


TAATACCTGG AATGCTGTTT TCCCCCGGAT CGCAGTGGTC AGTAACCATG 
ATTATGGACC TTACGACAAA AGGGCCCCTA GCGTCACCAC TCAT7GGTAC 




6701 


CATCATCAGG AGTACGGATA AAATGCTTGA TGGTCGGAAG AGGCATAAA? 
GTAGTAGTCC TCATGCCTAT TTTACGAACT ACCAGCCTTC 7CCGTATTTA 




6751 


TCCGTCAGCC AGTTTAGTCT GACCATCTCA 7CTGTAACAT CATTGGCAAC 
AGGCAGTCGG TCAAATCAGA CTGGTAGAGT AGACATTGTA GTAACCCTM 




6801 


GCTACCTTTG CCATGTTTCA GAAACAACTC TGGCGCATCG GGCTTCCCAT 
CGATGGAAAC GGTACAAAGT CTTTGTTCAG ACC GCGTAGC CCGAAGGGTA 




6851 


ACAATCGATA GATTGTCGCA CCTGATTCCC CGACATTATC CCGAGCCCAT 
TGTTAGCTAT CTAACAGCGT GGACTAACGG CCTGTAATAG CGCTCGGGTA 






Xhol 




6901 


TTATACCCAT ATAAATCAGC ATCCATGTTG GAATTTAATC GCGGCCTCGA 
AATATGGGTA TATTTAGTCG TACGTACAAC CTTAAATTAG CGCCGGAGCT 





Xhox 



6951 GCAAGACGTT TCCCGTTGAA TATGGCTCAT AACACCCCTT GTATTACTGT 
CCTTCTGCAA AGGGCAACTT ATACCGAGTA TTGTGGGCAA CATAATGACA 



7001 TTATGTAAGC AGACAGTTTT ATTGTTCATG ATGATATATT TTTATCTTGT 
AATACATTCG TCTGTCAAAA TAACAAGTAC TACTATATAA AAATAGAACA 
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Drain 

70S1 GCAATGTAAC ATCAGAGATT TTGAGACACA ACGTGGCTTT CCCCCCCCCC 
CGTTACATTG TAGTCTCTAA AACTCTGTGT TGCACCGAAA CGG6GGGGGG 



7101 CCATTATTGA AGCATTTATC AGGGTTATTG TCTCATGAGC GGATACATAT 
GGTAATAACT TCGTAAATAO TCCCAATAAC AGAGTACTCC CCTATGTATA 



7X51 TTGAATGTAT TTAGAAAAAT AAACAAATAG GGGTTCCGCG CACATTTCCC 
AACTTACATA AATCTTTTTA TTTGTTTATC CCCAAGGCGC GTGTAAAGGG 



7201 CGAAAAGTGC CACCTGACGT CTAAGAAACC ATTATTATCA TGACATTAAC 
GCTTTTCACG GTGGACTGCA GATTCTTTGG TAATAATAGT ACTGTAATTG 



7251 CTATAAAAAT AGGCGTATCA CGAGGCCCTT TCCTC 
GATATTTTTA TCCGCATAGT CCTCCGGGAA AGCAG 



WO 99/32147 



34 



O "3 O ..1. 3 O !i 3 . O-e .1. .7" Oi 

PCT/US98/27364 



General Description —'^^U^ > 

BKX dVR 1012-SCPfZ) U ^ 



OKX pVR lOia-SCP(Z) 
Local object 
Cxeatedi 09/14/98 04s29FM 
Laat Modified! 09/15/98 04 1 50PM 
length: 7272 bp 
storage type: Basic 
formi Circular 
Ccnrnents 

Restriction Map 

uraiu. l sue GXCTHNCAC 
Hrndllhtsite 

Hpatnsrte «t*ac 

Kpnl:1site <g»« 

NotI:ls«e g^iigg 
PrntlHsite 

PvuMsite *™g» 

Sacll: 1 site ggggg! 

Xbainsite *ggg£ 

Xho!:1stte <=tcgac 
EcoRV:2sites ™g« 
Ncol:2srtes ™« 
Nde!:2sites g$g*g 
Spht:2sites gggjg 

Functional Map 

CDS (4 signals) 

CMV IE 5' UT 

Starts 886 End* 1129 
CMV IE INT 

Starts 1130 Eadi 1840 

TbQH 

Start: 4289 End: 4841 
Kanr 

Starts 6337 End: £959 (complementary) 

Miscjeature (2 signals) 
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CMV enhancer 

Start: 248 End: 885 
SGP(Z) 

Startx 1870 End* 4288 



Annotations 
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X 


TCCCGCGTTT 
ACCGCGCAAA 


CGGTGATGAC GGTGAAAACC TCTGACACAT GCAGCTCCCG 
GCCACTACTG CCACTTTTGG AQACTCTGTA CGTCGAGGGC 


51 


GACACGG7CA 
CTCTGCCAGT 


CAGCTTGTCT GTAAGCGGAT GCCGGGAGCA GACAAGCCCG 
GTCGAACAGA CATTCGCCTA CGGCCCTCGT CTGTTCGGGC 


101 


TCAGGGCGCG TCAGCGGGTG TTGGCGGGTC TCGGGGCTCG CTTAACTATG 
ACTCCCGCCC AGTCGCCCAC AACCGCCCAC ACCCCCGACC GAATTCATAC 






Ndel 


151 


CGGCATCACA 
CCCGTACTCT 


GGACATTGTA CTGAGAGTGC ACCATATGCG GTGTGAAATA 
CGTCTAACAT GACTCTCACG TGCTATACGC CACACTTTAT 


201 


CCGCACAGAT 
GGCCTGTCTA 


GCGTAAGGAG AAAATACCGC ATCAOATTGG CTATTGGCCA 
CGCATTCCTC TTTTATGGCG TAGTCTAACC GATAACCGGT 


251 


TTGCATACGT TGTATCCATA TCATAATATG TACATTTATA TTGGCTCATG 
AACGTATGCA ACATAGGTA7 ACTATTATAC ATGTAAATAT AACCGAGTAC 


301 


TCCAACATTA CCGCCATGTT GACATTGATT ATTGACTAGT TATTAATAGT 
ACGTTGTAAT GGCCGTACAA CTG7AACTAA TAACTGATCA ATAATTATCA 


351 


AATCAATTAC 
TTAGTTAATG 


GGGGTCATTA GTTCATAGCC CATATATGGA GTTCCGCCOT 
CCCCAGTAAT CAAGTATCGG CTATATACCT CAAGGCGCAA 


401 


acataactta 
tgtattgaat 


CGGTAAATGG CCCGCCTGGC TGACCGCCCA ACGACCCCCG 
GCCATTTACC GGGCGGACCG ACTGGCGGGT 7GCTGGGGGC 


451 


CCCATTGACG 
GGGTAACTGC 


TCAATAATGA CGTATGTTCC CATAGTAACG CCAATAGGGA 
AGTTATTACT CCATACAAGG GTATCATTGC GCTTATCCC? 


501 


CTTTCCATTC 
GAAAGGTAAC 


ACGTCAATGG GTGGAGTATT TACGGTAAAC TGCCCACTTG 
TGCAGTTACC CACCTCATAA ATGCCATTTG ACGGGTGAAC 






Ndcl 


551 


GCACTACATC 
CGTCATGTAG 


AAGTCTATCA TATGCCAAGT ACGCCCCCTA TTGACGTCAA 
TTCACATAC? ATACGCTTCA TCCGGGGGAT AACTGGAGTT 


601 


TGACGGTAAA 
ACTGCCATTT 


TGGCCCGCCT GGCATTATGC CCAGTACATC ACCT7ATGGG 
ACCGGGCGGA CCGTAATACG GGTCATGTAC TCGAATACCC 






Nool 


651 


ACTTTCCTAC 
TCAAAGGATG 


TTGGCACTAC ATCTACGTAT TAGTCATCGC TATTACCATG 
AACCGTCATG TAGATGCATA ATCAGTAGCG ATAATGCTAC 




NCOI 




701 


OTGATGCGGT TTTGGCAGTA CATCAATGGG CGXCGATAGC GGTTTGACTC 
CACTACGCCA AAACCGTCAT GTAGTTACCC GCACCTATCG CCAAACTGAG 



751 ACCGGGATTT CCAAG^CTCC ACCCCATTGA CGTCAATCGG AGTTTCTTTT 
TGCCCCTAAA GGTTCAGAGG TGGGGTAACT GCAGTTACCC TCAAACAAAA 



601 GGCACCAAAA TCAACGGGAC TTTCCAAAAT GTCGTAACAA CTCCGCCCCA 
CCCTGGTTTT AGTTGCCCTG AAAGG7TTTA CAGCATTCTT GAGGCGGGGT 
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851 TTCACGCAAA TGGGCCCTAG GCCTCTACGG TGGGAGCTCT ATATAAGCAG 
AACTGCGTTT ACCCGCCATC CGCACATCCC ACCCTCCAGA TATATTCGTC 



901 AGCTCGTTTA GTCAACCGTC A3ATCGCCTG GAGACCCCAT CCACGCTGTT 
TCGAGCAAAT CACTTGGCAG TCTAGCGGAC CTCTGCGGTA GGTGCGACAA 



sacix 

951 TTGACCTCCA TAGAAGACAC CGGGACCGAT CCAGCCTCCG CGGCCGGGAA 
AACTCGAGGT ATCTTCTGTG GCCCTGGCTA GCTCGGACGC GCCGGCCCTT 



1001 CGGTGCATTG GAACGCCGAT TCCCCGTGCC AAGAG7GACC TAAGTACCGC 
GCCACGTAAC CTTGCCCCTA AGGGGCACGG TTCTCACTGC ATTCATGGCG 



SphI 

1051 CCATAGACTC TATACCCACA CCCCTTTGGC TCTTATCCAT GCTATACTGT 
GATATCTGAG ATATCCGTGX GGGGAAACCG AGAATACGTA CGATATGACA 



1101 TTTTGGCTTG GGGCCTATAC ACCCCCGCTT CCTTATGCTA TAGGTGATGG 
AAAACCGAAC CCCGGATATG TGGGGGCGAA GGAAXACGAT ATCCACTACC 



1151 TATAGCTTAG CCTATAGGTG TGCGTTATTG ACCATTATTG ACCACTCCCC 
ATATCGAATC GGATATCCAC ACCCAATAAC TGGTAATAAC 7GGTGAGGGG 



1201 TATTGGTGAC GATACTTTCC ATTACTAATC CATAACATGG CTCTTTCCCA 
A7AACCACTG CTATGAAAGG TAATGATOAG GTATTGXACC GAGAAACGGT 



1251 CAACTATC7C TATTGGCTAT ATGCCAATAC TCTGTCCTrC AGAGACTGAC 
GTTGATAGAG ATAACCGATA TACCGTTATG AGACAGGAAG TCTCTGACTG 



1301 ACGGACTCTG TATTTTTACA GGATGGGGTC CCATOTATTA TTTACAAATT 
TGCCTGAGAC A7AAAAATGT CCTACCCCAG GGTAAATAAT AAATGTTTAA 



1351 CACATATACA ACAACGCCGT CCCCCGTGCC CGCAGTTTTT ATTAAACATA 
CTGTATATGT TGTTGCGGCA GGGGGCACGG GCGTCAAAAA TAATTTGTAT 



1401 GCGTGGGATC TCCACGCGAA TCTCGGGTAC GTGTTCCGGA CATGGGCTCT 
CGCACCCTAG AGGTGCGCTT AGAGCCCATG CACAAGGCCT GTACCCGAGA 



1451 7CTCCGGTAG CGGCGGAGCT TCCACATCCG AGCCC7GGTC CCATGCCTCC 
AGAGCCCATG GCCGCCTCGA AGGTGTAGGC TCGGGACCAG GGTACGGAGG 



1501 AGCGGCTCAT CGTCGCTCGG CAGCTCCTTG CTCCTAACAG TGGAGGCCAG 
TCCCCGAGTA CCAGCGAGCC GTCGAGGAAC GAGGATTGTC ACCTCCGGTC 



1551 ACTTAGCCAC AGCACAATGC CCACCACCAC CAGTGTGCCC CACAAGCCCG 
TOAATCCGTG TCGTGTTACG GGTGGTGGTC GTCACACGGC GTGTTCCGGC 



1601 TCGCGGTAGG GTATGTGTCT GAAAATGAGC GTGCAGATTG GGCTCGCACG 
ACCGCCATCC CATACACAGA CTTTTACTCG CACCTCXAAC CCGAGCGTGC 



1651 GCTGACGCAG ATGGAAGACT TAAGGCAGCG GCAGAAGAAG ATGCAGGCAG 
CCACTGCGTC TACCTTCTGA ATTCCGTCGC CGTCTTCTTC TACGTCCCTC 



1701 CTGAGTTGTT GTATTCTGAT AAGAGTCAGA GGTAACTCCC CTTGCGGTCC 
GACTCAACAA CATAAGACTA TTCTCAGTCT CCATTGAGGG CAACGCCACG 
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Hpal 

-»751 TOTTAACGGT GGACGGCAGT GTAGTCTGAG CAGTACTCG7 TGCTGCCCCG 
ACAATTCCCA CCTCCCGTCA CATCAGACTC GTCATGAGCA ACGACCGCGC 

Ncol 

1B01 CGCCCCACCA GACATAATAG CTGACAGAC? AACAGACTGT TCCTTTCCAT 
CCGCGGTCGT CrCTATTATC GACTGTCTGA TTCTCTGACA AGGAAAGGTA 



NCOl 



Pxall EcoRVNotl 



lfi51 GGGTCTTTTC TGCAGTCACC GTCGTCGACA CGTCTCATCA CATATCGCGG 
CCCAGAAAAQ ACCTCAGTGC CAGCAGCTGT GCACACTAGT CTATAGCGCC 

NotI XbaX 

1901 CCGCTCTAGA CCAGGCGCCT GGATCGAATT GATGAAGATT AACCCGACAG 
GGC3AGATCT GGTCCGCGGA CCTAGCCTAA CTACTTCTAA TTCGGCTGTC 

1951 TGACCGTAAT CTTCATCTCT CTTAGATTAT TTGTTTTCCA GAGTAGGGGT 
ACTCGCATTA GAAGTAGAGA GAATCTAATA AACAAAAGGT CTCATCCCCA 

2001 CGTCAGGTCC TTTCCAATCG TGTAACCAAA ATAAACTCCA CTAGAAGGAT 
CCAGTCCAGG AAAAGTTAGC ACATTCGTTT 7ATTTGAGGT GATCTTCCTA 

2051 At?TGTGGGGC AACAACACAA TGGGCGTTAC AGGAATATTG CAGTTACCTC 
TAACACCCCG TTGTTG7GTT ACCCGCAATG TCCTTATAAC GTCAATGCAG 

2101 GTGATCGA7T CAAGAGGACA TCATTCTTTC TTTGGGTAAT TATCCTTTTC 
CAC7AGCTAA GTTCTCCTGT AGTAAGAAAG AAAOCCATTA ATAGGAAAAG 

2151 CAAAGAACAT T7TCCATCCC AC77GGAGTC ATCCACAATA GCACATTACA 
GT7TCTTGTA AAAGGTAGGG TGAACCTCAG TAGGTGTTAT CGTGTAATGT 

2201 GGTTAGTGA7 CTCGACAAAC TAGTTTGTCG 7GACAAACTG TCATCCACAA 
CCAATCACTA CAGCTCTTTG ATCAAACAGC ACTGTTTGAC AGTAGGTGTT 

2251 ATCAATTGAG ATCAGTTGGA CTGAATC7CG AAGGGAATGG AGTGGCAACT 
TAGTTAACTC TACTCAACC7 GACTTAGAGC TTCCCTTACC TCACCCTTGA 

23 01 GACGTGCCAT CTGCAACTAA AAGATGGGGC TTCAGGTCCG GTGTCCCACC 

CTGCACGGTA GACGTTGATT TTCTACCCCG AAGTCCAGGC CACAGGGTCG 

2"*51 AAAGGTGG7C AATTATGAAG CTGGTGAATG GGCTGAAAAC TGCTACAATC 
TTTCCACCAG TTAATACTTC GACCACTTAC CCGACTTTTG ACGATGTTAG 

24 0 1 TTGAAATCAA AAAACCTGAC GGGAGTGAGT GTCTACCAGC AGCGCCAGAC 

AACTTTAGTT TTTTGGACTG CCCTCACTCA CAGATCGTCG TCGCGGTCTG 

2451 GCGX77CGGG GCTTCCCCCG GTGCCGGTAT GTGCACAAAG TATCAGGAAC 
CCCTAAGCCC CGAAGGGGGC CACGGCCATA CACGTGTTTC ATAGTCCTTG 

2501 GCCACCGXGT GCCGGAGACT TTCCCTTCCA TAAAGAGGGT GCTTTCTTCC 
CCCTGGCACA CGGCCTCTGA AACGGAAGGT ATTTCTCCCA CGAAAGAAGG 



2551 TGTATGATCG ACTTGCTTCC ACAGTTATCT ACCCAGGAAC GACTTTCGCT 
ACA7ACTAGC TGAACGAAGG TGTCAATAGA TGGCTCCTTG CTGAAAGCGA 
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2601 GAAGGTCTCG TTGCATTTCT GATACTGCCC CAMSCT AAOA XGGACTTCTT 
CTTCCACAGC AACGTAAAGA CTATGACGCG GTTCGATTCT TCCTGAAGAA 

2651 " CAGCTCACAC CCCTTGAGAG AGCCGGTCAA TGCAACGGAC GACCCCTCTA 
GTCCAGTGTC GGGAACTCTC TCGGCCAGTT ACGTTGCCTC CTGGGCAGAT 

, EcoRV 

2701 GTGGCTACTA TTCTACCACA ATTAGATATC AGGCTACCGG TTTTGGAJ^C 
CACCGATGAT AAGATGGTGT TAATCTATAG TCCGATGGCC AAAACCTTGO 

27S1 AATGAGACAG ACTACTTCTT CGAGGTTGAC AATTTGACCT ACGTCCAACT 
TTACTCTGTC TCATCAACAA CCTCCAACTG •TTAAACTGGA TGCACGTTGA 

2801 TGAATCAAGA TTCACACCAC AGTTTCTCCT CCACCTOAA* <^£J^TAT 
ACT7ACTTCT AAGTGTCCTG TCAAAGACGA GGTCGACTTA CTCTGTTATA 

2851 ATACAAGTGG GAAAAGGAGC AATACCACGG GAAAACTAAT TTCGAAGGTC 
7ATGTTCACC CTTTTCCTCG TTATGGTGCC CTTTTGATTA AACCTTCCAG 

2901 AACCCCGAAA TTGATACAAC AATCGGGGAG TGGGCCTTCT GGCAAACTAA 
TTGGCGCTTT AACTATCTTG TTAGCCCCTC ACCCGGAAGA CCCTTTGATT 

29 SI AAAAACCTCA CTAGAAAAAT TCGCAGTGAA GAGTTGTCTT TCACAGTTGT 
TTTTTGGAGT GATCTTTTTA ACCGTCACTT CTCAACAGAA AGTGTCAACA 

3001 ATCAAACGGA GCCAAAAACA TCAGTGGTCA GAGTCCGGCC CGAACTTCTT 
TAGTTTGCCT CGGTTT7TCT AGTCACCAGT CTCAGGCCGC GCTTGAAGAA 

3051 CCCACCCAGG GACCAACACA ACAACTGAAG ACCAC AAAAT CATCCCTTCA 
CGCTGGGTCC CTGGTTGTGT TGTTGACTTC CGGTGTTTTA GTACCGAAGT 

3101 GAAAATTCCT CTGCAATGGT TCAAGtGCAC ACTCAAGGAA GGGAAGCTGC 
CTTTTAACGA GACGTTACCA AGTTCACGTG TCAGTTCCTT CCCTTCGACG 

3151 AGTGTCGCAT CTAACAACCC TTCCCACAAT CTCCACGAGT CCCCAATCCC 
TCACAGCGTA GATTGTTGGG AACGGTGTTA GACGTGCTCA GGGGTTAGGG 

3201 TCACAACCAA ACCAGGTCCG GACAACAGCA CCCATAATAC ACCCGTGTAT 
AGTGTTGGTT TGGTCCAGGC CTGTTGTCGT CGGTATTATG TGGGCACATA 

3251 AAACTTGACA TCTCTGAGGC AACTCAAGTT GAACAACATC ACCCCAGAAC 
TTTGAACTGT AGAGACTCCG TTGAGTTCAA CTTGTTGTAG TGGCGTCTTG 

3301 ACACAACGAC AGCACAGCCT CCGACACTCC CTCTGCCACG ACCCCAGCCG 
TCTGTTGCTG TCGTGTCGGA GGCTGTGAGC GAGACGGTGC TGGCGTCGGC 

3351 GACCCCCAAA AGCAGAGAAC ACCAACACGA CCAAGAGCAC TGACTTCCTC 
CTGGGGGTTT TCGTCTCTTG TCGTTGTGCT CGTTCTCGTC ACTGAAGGAC 

3401 GACCCCGCCA CCACAACAAG TCCCCAAAAC CAGAGCGAGA CCGCTGGCAA 
CTGGGGCGGT GCTCTTCTTC AGGCGTTTTG GTGTCCCTCT GGCGACCGOT 

3451 CAACAACACT CATCACCAAG A7ACCGGAGA AGAGAOTGCC AGCAGCGGGA 
GTTGTTGTGA GTAGTGGTTC TATGGCCTCT TCTCTCACGG TCGTCCCOCT 
> 
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3501 AGCTAGGCTT AATTACCAAT ACTATTGCTG CAGTCGCAGG ACTGATCACA 
TCGATCCGAA TTAATGGTTA TGATAACGAC CTCAGCGTCC TGACTAGTCT 



3551 GGCGGGAGAA GAACTCGAAG AGAAGCAATT GTCAATGCTC AACCCAAATG 
CCGCCCTCTT CTTGAGCTTC TCTTCGTTAA CAGTTACGAG TTCGGTTTAC 



3601 CAACCCTAAT TTACATTACT GGACTACTCA GGAIGAACGT GCTGCAATCG 
GTTGGGATTA AATGTAATGA CCTGATGACT CCTACTTCCA CCACGTTACC 



36S1 GACTGCCCTG CATACCATAT 7TCGGGCCAG CAGCCGAGGG AATTTACATA 
CTGACCGGAC CTATGGTATA AAGCCCGGTC GTCGGCTCCC TTAAATGTAT 



3701 GAGGGGCTAA TCCACAATCA AGATGGTTTA ATCTGTGGCT TGAGACAGCT 
CTCCCCGATT ACGTGTTACT TCTAGCAAAT TAGACACCCA ACTCTGTCGA 



3751 GGCCAACGAG ACGACTCAAG CTCTTCAACT GTTCCTGAGA GCCACAACTG 
CCGGTTCCTC TCCTGAGTTC GAGAAGTTGA CAAGGACTCT CCGTGTTGAC 



3801 AGCTACGCAC CTMTCAATC CTCAACCGTA AGGCAATTGA TTTCTTGCTG 
TCGATCCGTG GAAAAOTTAG GAGTTOOCAT TCCGTTAACT AAAGAACCAC 



3851 CAGCGATGGG GCGGCACATG CCACATTCTG GGACCGGACT GCTGTATCGA 
GTCCCTACCC CGCCGTGTAC CGTG7AAGAC CCTGGCCTGA CGACATAGCT 



3901 ACCACATGAT TGGACCAAGA ACATAACAGA CAAAATTGAT CAGATTATTC 
TGGTCTACTA ACCTGC7TCT TGTATTGTCT GTTTTAACTA GTCTAATAAG 



3951 ATGATTTTGT TGATAAAACC CTTCCGGACC AGGGGGACAA TGACAATTGG 
TACTAAAACA ACTATTTTGG GAAGGCCTGG TCCCCCTGTT ACTGTTAACC 



4001 TGGACAGGAT GGAGACAATG GATACCGGCA GGTATTGGAG TTACAGGCGT 
ACCTGTCCTA CCTCTGTTAC CTATCCCCGT CCATAACCTC AATGTCCGCA 



4051 TATAATTCCA GTTATCGCTT TATTCTGTAT ATGCAAATTT GTCTTTTAGT 
ATATTAACGT CAATAGCGAA ATAAGACATA TACGTTTAAA CAGAAAATCA 



4101 TTTTCTTCAG ATTGCTTCAT GGAAAAGCTC AGCCTCAAAT CAATGAAACC 
AAAAGAAGTC TAACGAAGTA CCTTTTCGAG TCCGAGTTTA GTTACTTTGG 



41S1 AGGATTTAAT 7ATATGGATT ACTTGAATCT AAGATTACTT GACAAATGAT 
TCCTAAATTA ATATACCTAA TGAACTTAGA TTCTAATGAA CTGTTTACTA 



4201 AATATAATAC ACTGGAGCTT TAAACATAGC CAATGTGATT CTAACTCCTT 
TTATATTATC 7GACCTCGAA ATTTGTATCG GTTACACTAA GATTCAGGAA 



4251 TAAACTCACA GTTAATCATA AACAAGGTTT CGAATTGATC TGCTGTGCCT 
ATTCCAGTGT CAAOTACTAT TTGTTCCAAA CCTTAACTAG ACGACACGGA 



4301 TCTAGTTGCC AGCCATCTGT TCTTTGCCCC TCCCCCGTGC CTTCCTTGAC 
AGATCAACGC TCGGTAGACA ACAAACGGGG AGGGGGCACG GAAGGAACTG 



4351 CCTGGAAGGT GCCACTCCCA CTGTCCTTTC CTAATAAAAT GAGGAAATTG 
CGACCTTCCA CCGTGAGGGT GACAGGAAAG GATTATTTTA CTCCTTTAAC 



4401 CATCGCATTG TCTGAGTAGG TGTCATTCTA TTCTGGGGGG TGCGGTGGGG 
CTAGCGTAAC AGACTCATCC ACAGTAAGAT AAGACCCCCC ACCCCACCCC 



1 
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4451 CAjCCACAGCA agggcgagga ttgggaagac aatagcaggc atgctgggga 

" OTCGTCTCGT TCCCCCTCCT AACCCTTCTG TTATCGTCCG TACOACCCCT 

Kpnl 

* 4501 TGCGGTGGGC TCTATGGGTA CCCAGCTGCT GAAGAATTGA CCCGGTTCCT 
ACGCCACCCG AGATACCCAT GGGTCCACGA CTTCTTAACT GGGCCAAGGA 

4551 CCTGGGCCAG AAAOAAGCAG GCACATCCCC TTCTCTGTGA CACACCCTGT 
CGACCCGGTC TTTCTTCGTC CGTCTAGGGG AAGAGACACT GTGTCGGACA 

4G01 CCACCCCCCT GGWCTTAGT TCCAGCCCCA CTCATAGGAC ACTCATAGCT 
GCTGCGCCGA CCAAGAATCA AGGTCCCCGT GACTATCCTG TGAG7ATCGA 

46S1 CAGGAGGGCT CCGCCTTCAA TCCCACCCGC TAAAGTACTT GGAGCGGTCT 
GTCCTCCCGA GGCGGAAGTT AGGOTGGGCG ATTTCATGAA CCTCGCCAGA 

4701 CTCCCTCCCT CATCAGCCCA CCAAACCAAA CCTAGC«CC AAGAGTGGGA 
GAGGGAGGGA GTAGTCGGGT GGTTTGGTTT GGATCCGAGG TTCTCACCCT 

4751 AGAAATOAAA CCAAGATAGG CTATTAAGTC CAGAGGGAGA GAAAATGCCT 
TC1TTAATTT CGTTCTATCC GATAATTCAC GTCTCCCTCT CTTTTACGGA 

4B0I CCAACATGTG AGGAACTAAT CAGAGAAATC ATACAATTTC TTCCGCTTCC 
GGT7G7ACAC TCCTTCATTA CTCTCTTTAG TATCTTAAAG AACGCGAAGG 

AflSl TCGCTCACTG ACTCGCTGCG CTCGGTCGTT CGGCTGCGGC GAGCGGTATC 
AGCGAGTGAC TGAGCGACGC GAGCCAGCAA GCCGACGCCG CTCGCCATAG 

4901 AGCTCACTCA AAGCCGGTAA TACGGTTATC CACAGAATCA GGGGATAACG 
TCGAGTGAGT TTCCGCCATT ATGCCAATAG GTGTCTTAGT CCCCTAWGC 

4951 CAGGAAAGAA CATGTGAGCA AAAGGCCAGC AAAAGGCCAG GAACCGTAAA 
GTCCTTTCTT GTACACTCGT TTTCCGCTCG TTTTCCGGTC CTTGGCATTT 

50 01 AAGGCCGCGT TGCTGGCGTT TOTCCATAGG CTCCGCCCCC CTGACGAGCA 
TTCCGGCGCA ACGACCCCAA AAAGGTATCC GAGGCGGGGG GACTGCTCGT 

5051 TCACAAAAAT CGACGCTCAA GTCAGAGCTG GCCAAACCCG ACAGGACTAT 
AGTGTTTT7A GCTGCGAGTT CAGTCTCCAC CGCTT*CCGC TCTCCTGATA 
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5351 CCACTTATCG CCACTGGCAG CAGCCACTGG TAACAGGATT AGCAGAGCGA 
GCTGAATAGC GGTGACCGTC GTCGGTGACC ATTGTCCTAA TCCTCTCGCT 



5401 GGTATGTAGG CGGTOCTACA GAGTTCTTGA ACTGOTGGCC TAACTACCGC 
CCATACATCC CCCACGATCT CTCAAGAACT TCACCACCCG ATTCATGCCG 



5451 TACACTAGAA GGACAGTATT TGGTATCTGC CCTCTGCTGA AGCCAGTTAC 
ATGTGATCTT CCTGTCATAA ACCATAGACC CGAGACGACT TCGCTCAATG 



5501 CTTCGGAAAA AGAGTTGGTA CCTCTTGATC CGGCAAACAA ACCACCCCTG 
GAAGCCTTTT TCTCAACCAT CGAGAACTAG CCCGTTTGTT TGGTGGCGAC 



5551 GTAGCGGTGG TTTTTTT G TT TGCAACCAGC AGATTACGCG CAGAAAAAAA 
CATCGCCACC AAAAAAACAA ACGTTCGTCG TCTAATGCGC GTCTTTTTTT 



5601 GGATCTCAAG AAGATCCTTT GATCTTTTCT ACGGGGTCTG ACGCTCAGTG 
CCTAGAGTTC TTCTAGGAAA CTACAAAAGA TCCCCCAGAC TGCGAGTCAC 



5651 GAACGAAAAC TCACGTTAAG GGATTTTGGT CATGAGATTA TCAAAAAGGA 
CCTGCTTTTG AOTGCAACTC CCTAAAACCA GTACTCTAAT AGTTTTTCCT 



5701 TCTTCACCTA GATCCTTTTA AATTAAAAAT GAAGTTTTAA ATCAATCTAA 
AGAAGTGGA? CTAGGAAAAT TTAATTTTTA CTTCAAAATT TAGTTAGATT 



5751 AGTATATATG AGTAAACTTG GTCTGACAGT TACCAATGCT TAATCAGTGA 
TCATATATAC TCATTTGAAC CAGACTCTCA ATCGCTACGA ATTAGTCACT 



5801 GGCACCTATC TCAGCGATCT GTCTATTTCG TTCATCCATA GTTGCCTGAC 
CCGTGGATAG AGTCGCTAGA CAGATAAAGC AAGTAGGTAT CAACGGACTG 



5851 TCCGGGGGGG GGGGGCGCTG AGGTCTGCCT CGTGAAGAAG GTGTTGCTGA 
AGGCCCCCCC CCCCCGCGAC TCCAGACGGA GCACTTCTTC CACAACGACT 



5901 CTCATACCAG GCCTGAATCG CCCCATCATC CACCCA GAAA GTGAGGGAGC 
GAGTATGGTC CGGACTTAGC GGGGTAGTAG GTCGGTCTTT CACTCCCTCG 



5351 CACCG7TGAT GAGAGCTTTG TTGTAGGTGG ACCAGTTGGT CA7TTTGAAC 
GTCCCAACTA CTCTCGAAAC AACATCCACC TGGTCAACCA CTAAAACTTG 



6001 TTTTGC7TTG CCACGGAACG GTCTGCGTTG TCGGGAAGAT GCGTGATCTG 
AAAACGAAAC GGTGCCTTGC CAGACCCAAC AGCCCTTCTA CGCACTAGAC 



6051 ATCCTTCAAC TCAGCAAAAG TTCGATTTA? TCAACAAAGC CGCCGTCCCG 
TAGGAAGTTG AGTCGTTTTC AAGCTAAATA AGTTGTTTCG GCGGCAGGGC 



6101 TCAAGTCAGC GTAATGCTCT GCCAGTCTTA CAACCAATTA ACCAATTCTG 
AGTTCAGTCG CATTACGAGA CGGTCACAAT GTTGGTTAAT TGGTTAAGAC 



6151 ATTACAAAAA CTCATCGAGC ATCAAATOAA ACTGCAATTT ATXCATATCA 
TAATCT'ITTT GAGTAGCTCG TAGTTTACW TGACCTTAAA TAAGTATAGT 



6201 GCATTATCAA TACCATATTT TTGAAAAAGC CGTTTCTGTA ATGAAGCAGA 
CCTAATAGTT ATGGTATAAA AACTTTTTCG GCAAAGACAT TACTTCCTCT 



6251 AAACTCACCG AGGCAGTTCC ATAGGATGGC AAGATCCTGG TATCGGTCTG 
TTTGACTGGC TCCGTCAAGG TATCCTACCG TTCTAGGACC ATAGCCAGAC 
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€301 


CGATTCCGAC TCGTCCAACA TCAATACAAC CTATTAATTT 
GCTAAGGCTG AGCAGGTTGT AGTTATCTTG GATAATTAAA 


CCCCTCCTCA 
GGGGAGCAGT 


6351 


AAAATAAGGT TATCAAGTCA GAAATCACCA TGAGTCACOA 
TTTTATTCCA ATAGTTCACT CTTTAGTGGT ACTCACTCCT 


CTGAATCCGG 
QACTTAGGCC 






Hindiiz 




6401 


TGAGAATGGC AAAAGCTTAT GCATTTCTTT CCAGACTTGT TCAACAGGCC 
ACTCTTACCG TTTTCGAATA CGTAAAGAAA GGTCTGAACA AGTTGTCCGG 


€451 


AGCCATTACG 
TCCGTAATCC 


CTCGTCATCA AAATCACTCG CATCAACCAA 
GAGCAGTAGT TTTAGTGAGC CTAGTTGGTT 


ACCGTTATTC 
TGGCAATAAG 






PvuZ 




€501 


ATTCGTGATT 
TAAGCACTAA 


GCGCCTGAGC GACACGAAAT ACGCGATCGC 
CCCGGACTCG CTCTGCTTTA TGCGCTAGCG 


TGTTAAAAGG 
ACAATTTTCC 


€551 


ACAATTACAA 
TGTTAATGTT 


ACAGGAATCG AATGCAACCG GCGCAGGAAC 
TGTCCTTAGC TTACGTTGGC CGCGTCCTTG 


ACTGCCAGCG 
TGACGGTCGC 


6601 


CATCAACAAT 
GTAGTTGTTA 


ATTTTCACCT CAATCAGCAT ATTCTTCTAA 
7AAAAGTGGA CTTAGTCCTA TAAGAAGATT 


TACCTGGAAT 
ATGGACCTTA 


€651 


GCTGTTTTCC 
CGACAAAAGC 


CGGGGATCCC AGTGGTGAGT AACCATGCAT 
GCCCCTAGCG TCACCACTCA TTGGTACGTA 


CATCACGAGT 
GTAGTCCTCA 


6701 


ACGGATAAAA TGCTTGATGG TCGGAAGAGG CATAAATTCC GTCAGCCAGT 
TCCCTATTTT ACGAACTACC AGCCTTCTCC GTATTTAAGG CAGTCGGTCA 


6751 


TTACTCTGAC CATCTCATCT GTAACATCAT TGGCAACGCT ACCTTTGCCA 
AATCAGACTG GTAGAGTAGA CATTGTAGTA ACCGTTGCGA TGGAAAGGGT 


6801 


TGTTTCAGAA ACAACTCTGG CGCA7CCCGC TTCCCATACA ATCGATAGAT 
ACAAAGTCTT TGTTGAGACC GCGTAGCCCG AAGGGTATGT TAGCTATCTA 


6851 


TGTCSCACCT GATTGCCCGA CATTATCGCG AGCCCATTTA 
ACAGCGTGGA CTAACGGGCT G7AATACCGC TCCGGTAAAT 


TACCCATATA 
ATGGGTATAT 






xhoi 




€901 


AATCAGCATC 
TTAGTCCTAG 


CATGTTGGAA TTTAATCGCG GCCTCGACCA AGACGTTTCC 
GTACAACCTT AAATTAGCGC CGGAGCTCGT TCTGCAAAGG 


6951 


CGTTGAATAT GGCTCATAAC ACCCCTTGTA TTACTGTTTA 
GCAACTTATA CCGAOTATTG TGGGGAACAT AATGACAAAT 


TGTAAGCAGA 
ACATTCGTCT 


7001 


CAGTTTTATT 
GXCAAAATAA 


CTTCATGATG ATATATTTTT ATCTTGTGCA 
CAACTACTAC TATATAAAAA TAGAACACGT 


ATGTAACATC 
TACATTGTAG 



Drain 



7051 AGAGATTTTG AGACACAACG TCGCTTTCCC CCCCCCCCCA TTATTGAAGC 
TCTCTAAAAC TCTGTGTTGC ACCGAAAGGG GGGGGGGGGT AATAACTTCG 



0 7101 ATTTATCAGG GTTATTGTCT CATGAGCGCA TACATATTTG AATGTATTTA 
TAAATAGTCC CAATAACAGA GTACTCGCCT ATCTAXAAAC TTACATAAAT 
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7151 CAAAA&TAAA CAAATAGGGG TTCCGCGCAC ATTTCCCCGA AAAGTGCCAC 
CTTTTTATTT GTTTATCCCC AAOGCGCGTG TAAAQGGGCT TTTCACGGTO 



7201 CTGACCTCTA AOAAACCATT ATTATCATOA CATTAACCTA TAAAAATAGO 
GACTGCAGAT TCTTTGOTAA TAATAGTACT GTAATTGGAT ATTTTTATCC 



7251 CCTATCACGA GGCCCTTTCG TC 
GCATAGTCCT CC6G6AAAGC AO 



